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Abstract. We study the limit theory of large threshold graphs and 
apply this to a variety of models for random threshold graphs. The 
results give a nice set of examples for the emerging theory of graph 
limits. 



1. Introduction 

Threshold Graphs. Graphs have important apphcations in modern sys- 
tems biology and social sciences. Edges are created between interacting 
genes or people who know each other. However graphs are not objects 
which are naturally amenable to simple statistical analyses, there is no nat- 
ural average graph for instance. Being able to predict or replace a graph by 
hidden (statisticians call them latent) real variables has many advantages. 
This paper studies such a class of graphs, that sits within the larger class of 
interval graphs [27i, itself a subset of intersection graphs [H]; see also j6]- 
Consider the following properties of a simple graph G on [n] := {1, 2, . . . , n}. 

(1.1) There are real weights Wi and a threshold value t such that there is 
an edge from i to j if and only if Wi + wj > t. Thus "t/ie rich people 
always know each other'\ 

(1.2) G can be built sequentially from the empty graph by adding vertices 
one at a time, where each new vertex, is either isolated (non-adjacent 
to all the previous) or dominating (connected to all the previous). 

(1.3) The graph is uniquely determined (as a labeled graph) by its degree 
sequence. 

(1.4) Any induced subgraph has either an isolated or a dominating vertex. 

(1.5) There is no induced subgraph 2K2, P4 or C4. (Equivalently, there is 
no alternating 4-cycle, i.e., four distinct vertices x, y, z, w with edges 
xy and zw but no edges yz and xw; the diagonals xz and yw may 
or may not exist.) 

These properties are equivalent and define the class of threshold graphs. The 
book by Mahadev and Peled [25^ contains proofs and several other seemingly 
different characterizations. Note that the complement of a threshold graph 
is a threshold graph (by any of (1.1)| - (1-5) ). By (1.2)[ a threshold graph is 



either connected (if the last vertex is dominating) or has an isolated vertex 
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(if the last vertex is isolated); clearly these two possibilities exclude each 
other when n > 1. 




Figure 1. A threshold graph 



Example 1.1. The graph in Figure [T] is a threshold graph, from (1.1) by 



taking weights 1,5,2,3,2 on vertices 1-5 with t = 4.5, or from (1.2) by adding 
vertices 3, 5 (isolated), 4 (dominating), 1 (isolated) and 2 (dominating). 

While many familiar graphs are threshold graphs (stars or complete graphs 
for example), many are not (e.g. paths or cycles of length 4 or more). For 
example, of the 64 labeled graphs on 4 vertices, 46 are threshold graphs; 
the other 18 are paths P4, cycles C4, and pairs of edges 2K2 (which is the 
complement of C4). Considering unlabeled graphs, there are 11 graphs on 
4 vertices, and 8 of them are threshold graphs. 

Random Threshold Graphs. It is natural to study random threshold 
graphs. There are several different natural random constructions; we will in 
particular consider the following three: 



(1.6) From (1.1) by choosing {?i'j}i<j<n as independent and identically 
distributed (i.i.d.) random variables from some probability distribu- 
tion. (We also choose some fixed t; we may assume t = by replacing 
Wi by Wi — t/2.) 

(1.7) From [(Tl2) by ordering the vertices randomly and adding the vertices 



one by one, each time choosing at random between the qualifiers 
'dominating' or 'isolated' with probabilities pi and 1—pi, respectively, 
1 < i < n. This is a simple random attachment model in a similar 
vein as those in [3D]. We mainly consider the case when all pi are 
equal to a single parameter p E [0, 1]. 
(1.8) The uniform distribution on the set of threshold graphs. 

Example 1.2. Figure |2] shows a random threshold graph constructed by 



(1.6) with Wi chosen independently from the standardized Normal distribu- 
tion and t = 3. About half of the vertices are isolated, most of those with 
negative weights. 
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Figure 2. A whole threshold graph with isolates (left) and 
with only the connected part expanded (right); the labels are 
the rounded weights Wi. 



Example 1.3. Figure |3] shows a random threshold graph constructed by 



(1.6) with Wi chosen as i.i.d. uniform random variables on [0, 1] and t = 1. 



This instance is connected; this happens if and only if the maximum and 
minimum of the tUj's add to more than 1 (then there is a dominating vertex); 
in this example this has probability 1/2. 

We show below (Corollaries 6.5 and |6.6| ) that this uniform weight model 
is equivalent to adding isolated or dominating nodes as in (1.7) with prob- 
ability p = 1/2, independently and in random order. It follows that this 
same distribution appears as the stationary distribution of a Markov chain 
on threshold graphs which picks a vertex at random and changes it to dom- 
inating or isolated with probability 1/2 (this walk is analysed in |7]). Fur- 
thermore, it follows from Subsection |2.1| that these models yield a uniform 
distribution on the set of unlabeled threshold graphs of order n. 
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Figure 3. A threshold graph with n = 20 and uniform Wi. 
It turns out that this instance had no isolates. The labels 
are the rounded weights Wi. 

Bipartite Threshold Graphs. We also study the parallel case of bipartite 
threshold graphs (difference graphs), both for its own sake and because one 
of the main theorems is proved by first considering the bipartite case. 

By a bipartite graph, we mean a graph with an explicit bipartition of the 
vertex set; it can thus be written as (Vi, V2, -B) where the edge set E CViX 
V2 . These following properties of a bipartite graph are equivalent and define 
the class of bipartite threshold graphs. (See |25J for further characterizations.) 

(1.9) There are real weights w'^, i £ Vi and Wj, j £ V2, and a threshold 
value t such that there is an edge from i to j if and only if w'^+Wj > t. 
(1.10) G can be built sequentially starting from ni white vertices and re2 
black vertices in some fixed total order. Proceeding in this order, 
make each white vertex dominate or isolated from all the black ver- 
tices that precede it and each black vertex dominate or isolated from 
all earlier white vertices. 
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(1.11) Any induced subgraph has either an isolated vertex or a vertex dom- 
inating every vertex in the other part. 

(1.12) There is no induced subgraph 2K2. 

Remarks. 1. Threshold graphs were defined by Chvatal and Hammer [8]. 
Bipartite threshold graphs were studied by Hammer, Peled and Sun [T7] 
under the name difference graphs because they can equivalently be charac- 
terized as the graphs {V,E) for which there exist weights w^, v and a 
real number t such that \wv\ < i for every v and uv G E \wu—Wv\ > t; 

it is easily seen that every such graph is bipartite with Vi = {v : > 0} 
and V2 = {v : < 0} and that is satisfies the definition above (e.g., with 
w'^ = and w'l = —w^)^ and conversely. We will use the name bipartite 
threshold graph to emphasize that we consider these graphs equipped with a 
given bipartition. The same graphs were called chain graphs by Yannakakis 
|33] because each partition can be linearly ordered for the inclusion of the 
neighborhoods of its elements. 

2. A suite of programs for working with threshold graphs appears in [15] 
with further developments in |2H [26] . 

3. The most natural class of graphs built from a coordinate system 
are commonly called geometric graphs [31^ or geographical graphs |2H [26] . 
Threshold graphs are a special case of these. Their recognition and manip- 
ulation in a statistical context relies on useful measures on such graphs. We 
will start by defining such measures and developing a limit theory. 

Overview of the Paper. The purpose of this paper is to study the limiting 
properties of large threshold graphs in the spirit of the theory of graph 
limits developed by Lovasz and Szegedy |22j and Borgs, Chayes, Lovasz, 
Sos, Vesztergombi |5] (and in further papers by these authors and others). 
As explained below, the limiting objects are not graphs, but can rather 
be represented by symmetric functions W{x,y) from [0,1]^ to [0,1]; any 
sequence of graphs that converges in the appropriate way has such a limit. 
Conversely, such a function W may be used to form a random graph G„ by 
choosing independent random points Ui in [0, 1], and then for each pair (i, j) 
with 1 < i < j < n flipping a biased coin with heads probability W{Ui, Uj), 
putting an edge from i to j if the coin comes up heads. The resulting 
sequence of random graphs is (almost surely) an example of a sequence of 
graphs converging to W. For Example |1.3[ letting n 00, there is (as we 
show in greater generality in Section |6]) a limit W that may be pictured as 
in Figure |4} 



One of our main results (Theorems 5.3) shows that graph limits of thresh- 



old graphs have unique representations by increasing symmetric zero-one 
valued functions W. Furthermore, there is a one-to-one correspondence be- 
tween these limiting objects and a certain type of 'symmetric' probability 
distributions Pw on [0, 1]. A threshold graphs is characterized by its degree 
sequence; normalizing this to be a probability distribution, say v{Gn)-, we 



show (Theorem 5.5) that a sequence of threshold graphs converges to W 
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Figure 4. The function W{x,y) for Example 1.3 
values have W{x,y) = 1, unhashed W{x,y) = 0. 



Hashed 



when 71 ^ oo if and only if u^Gn) converges to Pw- (Hence, Pw can be 
regarded as the degree distribution of the limit. The result that a limit of 
threshold graphs is determined by its degree distribution is a natural ana- 
logue for the limit objects of the fact that an unlabeled threshold graph is 
uniquely determined by its degree distribution.) 

Figure [5] and Figure [7] show simulations of these results. In Figure |5] 
10,000 graphs with n = 50 were generated from |( 1.6) with uniform weights 
as in Example 1 1.3 1 

In the bipartite case, there is a similar 1-1 correspondence between the 
limit objects and probability distributions on [0,1]; now all probability dis- 
tributions on [0,1] appear in the representation of the limits (Theorem 5.1). 

Section |2] discusses uniform random threshold graphs (both labeled and 
unlabeled) and methods to generate them. Section [s] gives a succint review 
of notation and graph limits. Section [4] develops the limit theory of degree 
sequences; this is not restricted to threshold graphs. Section [5] develops the 
limit theory for threshold graphs both deterministic and random. Section [6 
treats examples of random threshold graphs and their limits, and Section 
gives corresponding examples and results for random bipartite thresho^ 
graphs. Section [9] treats the spectrum of the Laplacian of threshold graphs. 

We denote the vertex and edge sets of a graph G by V{G) and E(G), and 
the numbers of vertices and edges by v{G) := |V^(G)| and e{G) := \E{G)\. 
For a bipartite graph we similarly use Vj{G) and Vj{G), j = 1,2. 

Throughout the paper, 'increasing' and 'decreasing' should be interpreted 
in the weak sense (non-decreasing and non- increasing) . Unspecified limits 



are as n 



oo. 
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Figure 5. Threshold graphs were generated with n = 50 as 
in Example 1.3 with uniform Wi and t = 1; this is the degree 
histogram for a sample of 10,000 random graphs. 



2. Generating threshold graphs uniformly 

This section gives algorithms for generating uniformly distributed thresh- 
old graphs. Both in the labeled case and in the unlabeled case. The algo- 
rithms are used here for simulation and in Sections |6] and [7] to prove limit 
theorems. 

Let %i and £T„ be the sets of unlabeled and labeled threshold graphs on 
n vertices. These are different objects, 7^ is a quotient of £T„, and we treat 
counting and uniform generation separately for the two cases. We assume 
in this section that n > 2. 

2.1. Unlabeled threshold graphs. We can code an unlabeled threshold 
graph on n vertices by a binary code 02 • • • ctn of length n — 1: Given a code 
«2 ■ ■ ■ ttni we construct G by (1.2) adding vertex z as a dominating vertex if 
and only if Oj = 1 {i > 2). Conversely, given G of order n > 2, let a„ = 1 
if there is a dominating vertex (G is connected) and = if there is an 
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00 01 10 11 

Figure 6. The four graphs in 73 and their codes. 



isolated vertex {G is disconnected); we then remove one such dominating or 
isolated vertex and continue recursively to define On-ij ■ • • , 02- 

Since all dominating (isolated) vertices are equivalent to each other, this 
coding gives a bijection between Tn and {0, 1}"~^. In particular, 

\Tn\ = 2"-\ n > 1. 

See Figure [6] for an example. 

This leads to a simple algorithm to generate a uniformly distributed ran- 
dom unlabeled threshold graph: we construct a random code by making 
n — 1 coin flips. In other words: 

Algorithm 2.1. Algorithm for generating uniform random unlabeled thresh- 
old graphs of a given order n. 



Step 1: Add n vertices by (1.2) each time randomly choosing 'iso- 



lated' or 'dominating' with probability 1/2. 

This is thus the same as the second method in Example |1.3[ so Corol- 
lary |6.5| shows that the first method in Example |1.3| also yields uniform 
random unlabeled threshold graphs (if we forget the labels). 

The following notation is used to define two further algorithms (Subsec- 
tion 2.3) and for proof of the limiting results in Section [7j 

Define the extended binary code of a threshold graph to be the binary code 
with the first binary digit repeated; it is thus aia2a3 • • • «n with qi := 02. 
The runs of O's and I's in the extended binary code then correspond to 



blocks of vertices that can be added together in (1.2) as either isolated or 
dominating vertices, with the blocks alternating between isolated and dom- 
inating. The vertices in each block are equivalent and have, in particular, 
the same vertex degrees, while vertices in different blocks can be seen to 
have different degrees. (The degree increases strictly from one dominating 
block to the next and decreases strictly from one isolated block to the next, 
with every dominating block having higher degree than every isolated block; 



cf. Example 2.3 below.) The number of different vertex degrees thus equals 



the number of blocks. 
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If the lengths of the blocks are 61, 62, • • • , ^r, then the number of auto- 
morphisms of G is thus nj=i ^i'l since the vertices in each block may be 
permuted arbitrarily. 

Note that if 61 , . . . , 6,- are the lengths of the blocks then 

r 

61 > 2, bk>l{k>2), Y,bk = n. (2.1) 

k=l 

Since the blocks are alternatingly dominating or isolated, and the first block 



may be either, each sequence bi, . . . ,br satisfying (2.1 ) corresponds to ex- 
actly 2 unlabeled threshold graphs of order n. (These graphs are the comple- 
ments of each other. One has isolated blocks where the other has dominating 
blocks.) 

2.2. Labeled threshold graphs. The situation is different for labeled 

threshold graphs. For example, all of the 2(2) = 8 labeled graphs with 
n = 3 turn out to be threshold graphs and for instance 



2 2y 



are distinguished. Hence the distribution of a uniform random labeled 
threshold graph differs from the distribution of a uniform unlabeled thresh- 



old graph (even if we forget the labels). In particular. Example 1.3 does not 
produce uniform random labeled threshold graphs. 

Let G be an unlabeled threshold graph with an extended code having 
block lengths (runs) 61, ... , br- Then the number of labeled threshold graphs 
corresponding to G is 'T-'/rii^j'j since every such graph corresponds to a 
unique assignment of the labels 1, . . . ,n to the r blocks, with bi labels to 
block i. (Alternatively and equivalently, this follows from the number YYi bjl 
of automorphisms given above.) 

The number t{n) := |£T„| of labeled threshold graphs [32 , A005840] has 
been studied by Beissinger and Peled |2j- Among other things, they show 
that 
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and by expanding the singularities (cf. [I2l Chapter IV]) the exact formula 
'("^ - y ( - l) (- '—Y, n > 2, (2.3) 



n\ ^ \log2 + 27riA; / Vlog 2 + 27riA; 

k=—oo 

where the leading term is the one with A; = 0, and thus the asymptotics 

i)f^r+^(»). kwis"^. (2.4) 



n\ Vlog2 / Vlog2/ ' ' I V /I - (2vr) 
where C('^) is the zeta function and thus C{n) 1; furthermore, 

n oo ^„ 

t{n) = 2Rn - 2nRn-i, n>2, with Rn = Yl ^'•^("' ^) = 2^ (2-5) 

k=i e=o 

where S{n, k) are Stirling numbers; Rn is the number of preferential arrange- 
ments of n labeled elements, or number of weak orders on n labeled elements 
|32| A000670], also called surjection numbers [12, II. 3]. (This is easily seen 
using the blocks above; the number of labeled threshold graphs with a given 
sequence of blocks is twice (since the first block may be either isolated or 
dominating) the number of preferential arrangements with the same block 
sizes; if we did not require bi > 2, this would yield 2Rn, but we have to 
subtract twice the number of preferential arrangements with 6i = 1, which 
is 2ni2„„i.) We note for future use the generating function [12, (11.15)] 

^-^ n! 2 — 

n=0 

Let t{n,j) be the number of labeled threshold graphs with j isolated 
points. Then, as also shown in |2j (and easily seen), for n > 2, 

t{n,0) = t{n)/2, 

'(pt(n-j,0) = iQ)t(n-i), 0<j<n-2, 
t{n,j)=L, j = n-l, (2.7) 

,1, j = n. 

Thus knowledge of t{n) provides t{n,j). 

These ingredients allow us to give an algorithm for choosing uniformly in 

Algorithm 2.2. Algorithm for generating uniform random labeled threshold 
graphs of a given order n. 

Step 0: Make a list of t{k) for k between 1 and n. Make lists of t{k, j) 
for A; = 1 , . . . , n and j = 0, . . . , k. 

Step 1: Choose an integer Jq in {0, . . . , n} with probability that jo = 
j given by t{n, j) /t{n). Choose (at random) a subset of jo points 
in {l,...,n}. These are the isolated vertices in the graph. Let 
n' := n — jg be the number of remaining points. If n' = then stop. 
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Step 2: Choose an integer ji in {1, . . . , n'} with probabihty that ji = 
j given by t{n\ j) / {t{n') — t{n',0)) = 2t{n\ j) /t{n') and choose (at 
random) j'l points of those remaining; these wih dominate all further 
points, so add edges between these vertices and from them to all 
remaining points. Update n' to n' — ji, the number of remaining 
points. If n' = then stop. 

Step 3: Choose an integer j2 in {1, . . . , n'} with probability that j2 = 
j given by 2t{n' , j)/t{n') and choose (at random) j2 points of those 
remaining; these will be isolated among the remaining points, so 
no further edges are added. Update n' to n' — ji, the number of 
remaining points. If n' = then stop. 
Step 4: Repeat from Step 2 with the remaining n' points. 

Alternatively, instead of selecting the subsets in Steps 1 and 2 at random, 
we may choose them in any way, provided the algorithm begins or ends with 
a random permutation of the points. 

The algorithm works because of a characterization of threshold graphs 
by Chvatal and Hammer j^, cf. (1.4)[ A graph is a threshold graph iff any 



subset S of vertices contains at least one isolate or one dominating vertex 
(within the graph induced by S). Thus in step 2, since there are no isolates 
among the n' vertices left there must be at least one dominating vertex. 
(Note that jo may be zero, but not ji, j2 . . . .) The probability distribution 
for the number of dominating vertices follows the same law as that of the 
isolates because the complement of a threshold graph is a threshold graph 
(or because of the interchangeability of O's and I's in the binary coding given 
earlier in this section). 



Note that this algorithm treats vertices in the reverse of the order in (1.2) 
where we add vertices instead of peeling them off as here. It follows that we 
obtain the extended binary code of the graph by taking runs of jo O's, ji I's, 
j2 O's, and so on, and then reversing the order. Hence, in the notation used 
above, the sequence (6^) equals (jk) in reverse order, ignoring jo if jo = 0. 
(In particular note that the last > 2, since t{n',n' — 1) = for n' > 2, 
which corresponds to the first block 6i > 2.) 

Example 2.3. A sequence of js generated for a threshold graph of size 20 is 
023111311311 2, which yields the sequence ddiiidid 
iiididddidiiof dominating and isolated vertices. A random 
permutation of {1, ... , 20} was generated and we obtain 

13 2 11 15 8 20 6 12 16 4 18 7 10 9 14 17 1 19 5 3 
dd i ii di d ii id id d di dii 

where d signifies that the vertex is connected to all later vertices in this list. 
The degree sequence is thus, taking the vertices in this order: 19, 19, 2, 2, 
2, 16, 3, 15, 4, 4, 4, 12, 5, 11, 11, 11, 8, 10, 9, 9. The extended binary code 
00101110100010100011 is obtained by translating i to and d to 1, and 
reversing the order. 
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6 13 21 29 37 45 53 61 69 77 85 93 



Figure 7. Threshold graphs were generated according to 
the algorithm of this section, this is the degree histogram. 

In Figure [7] 10,000 graphs were generated with n = 100 according to 
the uniform distribution over all labeled threshold graphs. We discuss the 
central 'bump' and other features of Figure [7] in Theorem 7.4 



2.3. The distribution of block lengths. We have seen in Subsection 2.1 
that if bi, . . . ,br are the lengths of the blocks of isolated or dominating 



vertices added to the graph when building it as in (1.2) then (2.1) holds. 
Consider now a sequence of independent integer random variables Bi,B2, ■ ■ ■ 
with Bi > 2 and Bj > 1 for j > 2, and let Sk '■= X]j=i be the partial 
sums. If some Sr = n, then stop and output the sequence {Bi, . . . ,Br). 
Conditioning on the event that Sr = n for some r, this yields a random 
sequence bi,...,br satisfying (2.1), and the probability that we obtain a 
given sequence (6j)i equals c]^J^^ ^{^j — ^j) fo^' some normalizing constant 

c. We now specialize to the case when Bi = {B* \ B* > 2) and Bj = 
{B* I i?* > 1) for j > 2, for some given random variable B* . Then the 



(conditional) probability of obtaining a given bi, . . . ,br satisfying (2.1 ) can 



THRESHOLD GRAPH LIMITS 



13 



be written 

JLi r{B* > 1) ^ ^ 

(with c' = c¥{B* > 1)/F{B* > 2)). 

There are two important cases. First, if we take B* ~ Ge(l/2), then 
= bj)/F{B* > 1) = 2-''i, and thus Q yields c'l'^J^^ = c'2-", so 



the probabihty is the same for all allowed sequences. Hence, in this case the 
distribution of the constructed sequence is uniform on the set of sequences 



satisfying (2.1), so it equals the distribution of block lengths for a random 
unlabeled threshold graph of size n. 

The other case is B* ~ Po(log2). Then F{B* > I) = I - 6'^°^^ = 1/2, 
and F{B* = bj)/F{B* > 1) = {log2)''^ /bjl. Thus, yields the proba- 

bility c'(log 2)"/ Y\j bjl, which is proportional to the number 2- n\/ Y\j bjl of 
labeled threshold graphs with the block lengths bi,. . . ,br- Hence, in this 
case the distribution of the constructed sequence equals the distribution of 
block lengths for a random labeled threshold graph of size n. 

We have shown the following result. 

Theorem 2.4. Construct a random sequence Bi, . . . , Br as above, based 
on a random variable B* , stopping when Bj > n and conditioning on 
EI = n. 

(i) If B* ^ Ge(l/2), then {Bi, . . . , B^-) has the same distribution as the 
block lengths in a random unlabeled threshold graph of order n. 

(ii) If B* Po(log2), then (i?i, . . . , 5^) has the same distribution as 
the block lengths in a random labeled threshold graph of order n. 

It follows that the length of a typical (for example a random) block con- 



verges in distribution to {B* \ B* > 1). Theorem 2.4 also leads to another 



algorithm to construct uniform random threshold graphs. 

Algorithm 2.5. Algorithm for generating uniform unlabeled or labeled thresh- 
old graphs of a given order n. 

Step 1: In the unlabeled case, let B* ~ Ge(l/2). In the labeled case, 

let B* ~ Po(log2). 
Step 2: Choose independent random numbers Bi, B2, ■ ■ ■ , B^-, with 

Bi = {B* \ B* > 2) and Bj = {B* \ B* > 1), j > 2, until the 
sum Bj > n. 
Step 3: If Bj > n, start again with Step 2. 

Step 4: We have found Bi, . . . ,Bt- with ^[ Bj = n. Toss a coin to 
decide whether the first block is isolated or dominating; the following 
blocks alternate. Construct a threshold graph by adding vertices as 
in 



1.2) block by block. 



Step 5: In the labeled case, make a random labeling of the graph. 

By standard renewal theory, the probability that Yl\ Bj is exactly n is 
asymptotically 1/E(S* \ B* > I) = F{B* > 1)/EB*, which is 1/2 in the 
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unlabeled case and 1/(2 log 2) « 0.72 in the labeled case, so we do not have 
to do very many restarts in Step 3. 



This section reviews needed tools from the emerging field of graph limits. 

3.1. Graph limits. Here we review briefly the theory of graph limits as de- 
scribed in Lovasz and Szegedy [22], Borgs, Chayes, Lovasz, Sos and Veszter- 
gombi [5] and Diaconis and Janson |10j . 

If F and G are two graphs, let t{F, G) be the probability that a ran- 
dom mapping (p : V{F) V{G) defines a graph homomorphism, i.e., that 
(j){v)(j){w) G F{G) when vw S E{F). (By a random mapping we mean a 
mapping uniformly chosen among ah v{GY^^^ possible ones; the images of 
the vertices in F are thus independent and uniformly distributed over V{G), 
i.e., they are obtained by random sampling with replacement.) 

The basic definition is that a sequence Gn of (generally unlabeled) graphs 
converges if t(-F, Gn) converges for every graph F; as in [TU] we will further 
assume v{Gn) oo. More precisely, the (countable and discrete) set U of all 
unlabeled graphs can be embedded in a compact metric space U such that 
a sequence Gn &U oi graphs with w(G„) oo converges in U to some limit 
r G Z// if and only if t{F, Gn) converges for every graph F (see |22], |5], [TO]). 
Let Uoo := U \ U he the set of proper limit elements; we call the elements of 
Uoo graph limits. The functionals t{F, ■) extend to continuous functions on 
U, so Gn ^ r G Uoo if and only if v{Gn) oo and t{F, Gn) — > t{F,T) for 
every graph F. 

Let W be the set of all measurable functions W : [0,1]^ [0)1] and 
let Ws be the subset of symmetric functions. The main result of Lovasz 
and Szegedy [22] is that every element of Uoo can be represented by a (non- 
unique) function W G Wj. We let T\y G Uoo denote the graph limit defined 
by W. (We sometimes use the notation r(I^) for readability.) Then, for 
every graph F, 



Moreover, define, for every n > 1, a random graph G{n,W) as follows: 
first choose a sequence Xi,X2, ■ ■ ■ ,Xn of i.i.d. random variables uniformly 
distributed on [0, 1], and then, given this sequence, for each pair {i,j) with 
i < j draw an edge ij with probability W{Xi, Xj), independently for all 
pairs (i,j) with i < j. Then the random graph G{n,W) converges to T]y 
a.s. as n ^ oo. 

If G is a graph, with V{G) = {1, . . . , v{G)} for simplicity, we define a func- 
tion Wg G Ws by partitioning [0, 1] into v{G) intervals li, i = 1, . . . ,v{G), 
and letting Wg be the indicator l[ij G E{G)] on /j x Ij. (In other words, 
Wg is a step function corresponding to the adjacency matrix of G.) We let 
7r(G) := T{Wg) denote the corresponding object in Uoo- It follows easily 



3. Graph limits 




(3.1) 
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from (3.1) tliat t{F,TT{G)) = t{F,G) for every graph F. In particular, if Gn 
is a sequence of graphs with v{Gn) — > oo, then Gn converges to some graph 
limit r if and only if 7r(G„) — > T in lioo. (Unlike [22j and \E\ we distinguish 
between graphs and limit objects and we do not identify G and vr(G), see 

cni.) 



3.2. Bipartite graphs and their limits. In the bipartite case, there are 
analoguous definitions and results (see [J^ for further details). We define 
a bipartite graph to be a graph G with an explicit bipartition V{G) = 
Vi{G)\JV2{G) of the vertex set, such that the edge set E{G) C Vi{G)xV2{G). 
Then we define t{F^ G) in the same way as above but now for bipartite graphs 
F, by letting </) = ((/>!, (/)2) be a pair of random mappings : Vj[F) — > 
Vj{G). We let B be the set of all unlabeled bipartite graphs and embed 
;B in a compact metric space B. A sequence {Gn) of bipartite graphs with 
vi{Gn)-,V2{Gn) — > OO Converges in B if and only if t{F,Gn) converges for 
every bipartite graph F. Let ^Soooo be the (compact) set of all such limits; 
we call the elements of ;Boooo bipartite graph limits. Every element of ^Soooo 
can be represented by a (non-unique) function W G W. We let T'^ £ Boooo 
denote the element represented by W and have, for every bipartite F 

KFXw)=l n ^(^Myj)da;i • • • dx^^(n dyi • • • dy^2(P). 

(3.2) 

Given W £ W and ni,n2 > 1, we define a random bipartite graph 
G{ni,n2,W) by an analogue of the construction in Subsection |3.1[ first 
choose two sequences Xi, X2, ■ ■ ■ , Xm and ¥±,¥2, . . . , ¥n2 of i.i.d. random 
variables uniformly distributed on [0,1], and then, given thess sequences, 
for each pair (i,j) draw an edge ij with probability W{Xi,¥j)^ indepen- 
dently for all pairs (i, j) G [ni x [722]. 

If G is a bipartite graph we define Wq £ W similarly as above (in general 
with different numbers of steps in the two variables; note that Wq now 



in general is not symmetric) and let t^{G) := T"{Wg)- Then, by (3.2), 
t(F,7r(G)) = t{F,G) for every bipartite graph F . Hence, if Gn is a sequence 
of bipartite graphs with vi{Gn),V2[Gn) — > cxd, then Gn converges to some 
bipartite graph limit T if and only if 7r(Gn) ^ T in ^B, 



-"oooo • 



3.3. Cut-distance. Borgs, Chayes, Lovasz, Sos and Vesztergombi |5| Sec- 
tion 3.4] define a (pseudo-)metric (5n on Ws called the cut- distance. This 
is only a pseudo-metric since two different functions in Ws may have cut- 
distance (for example, if one is obtained by a measure preserving transfor- 
mation of the other, see further [3] and [TO]), and it is shown in [5j that, in 
fact, 5u{Wi, W2) = if and only if t{F, Wi) = t{F, W2) for every graph F, 
i.e., if and only if = ™ ^00 • Moreover, the quotient space Ws/(5n, 
where we identify elements of Ws with cut-distance 0, is a compact metric 
space and the mapping W 1— > T\y is a homeomorphism of Ws/^n onto Uoo- 
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This extends to the bipartite case. In this case, we define 6'^ on W as 
(5n is defined in [5l Section 3.4], but allowing different measure preserving 
mappings for the two coordinates. Then, if we identify elements in W with 
cut-distance 0, W ^ T'^ becomes a homeomorphism of onto Boooo- 

Instead of repeating and modifying the complicated proofs from one can 
use their result in the symmetric case and define an embedding W W 
W into Ws by 



W{x,y) 



0, rc<l/2, y<l/2 

1, x>l/2, y>l/2 
\ + ^W{2x-l,2y), x>l/2, y<l/2 

[\ + lW{2y-l,2x), x<l/2,y>l/2 



It is easily seen that 6'^{Wi,W2) and 5d(VFi,VF2) are equal within some 
constant factors, for Wi,W2 G W, and that for each graph F, t{F,W) is a 
linear combination of t(Fi, W) for a family of bipartite graphs F (obtained 
by partitioning V{F) and erasing edges within the two parts). This and the 
results in [5], together with the simple fact that W i— > t{F, W) is continuous 
for 5g for every bipartite graph F, imply easily the result claimed. 

3.4. A reflection involution. If G is a bipartite graph, let be the 
graph obtained by interchanging the order of the two vertex sets; thus, 
Vj{G^) = V3-j{G) and E{G^) = {uv : vu £ E{G)}. We say that G^ is the 
reflection of G. Obviously, t{F,G'^) = t(F^^,G) for any bipartite graphs F 
and G. It follows that if Gn T £ B, then Gn for some € B, and 

this defines a continuous map of B onto itself which extends the map just 
defined for bipartite graphs. We have, by continuity, 

t{F, rt) = t{FlT), F£B,G£B. (3.3) 

Furthermore, F^^ = F, so the map is an involution, and it maps ;Boooo onto 
itself. 

For a function W on [0,1]^, let W'^{x,y) := W{y,x) be its reflection in 



the main diagonal. It follows from (|33| and (|0 that T"{W'<) = T"{W) 



it 



3.5. Threshold graph hmits. Let T := IJ^i be the family of all (un- 
labeled) threshold graphs. Thus T is a subset of the family U of all unlabeled 
graphs, and we define T as the closure of T in Z^, and 7^ := T\T = TnUoo, 
i.e., the set of proper limits of sequences of threshold graphs; we call these 
threshold graph limits. 

In the bipartite case, we similarly consider the set T" := |J^^ ^^^^ ^i,n2 
B of all bipartite threshold graphs, and let T" C B he its closure in B and 
7^ ^ := T'TiBoooc the set of proper limits of sequences of bipartite threshold 
graphs; we call these bipartite threshold graph limits. 

Note that T, 7^, T", ^ are compact metric spaces, since they are 
closed subsets oiU oi B. 
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We will give concrete representations of the threshold graph limits in 
Section |5j Here we only give a more abstract characterization. 

Recall that t(F, G) is defined as the proportion of maps V[F) — > V{G) 
that are graph homomorphisms. Since we only are interested in limits with 
v{G) — > oo, it is equivalent to consider injective maps only. By inclusion- 
exclusion, it is further equivalent to consider tind(-^) G), defined as the prob- 
ability that a random injective map V{F) V^G) maps F isomorphically 
onto an induced copy of F in G; in other words, tindC-f") G) equals the num- 
ber of labeled induced copies of -F in G divided by the falling factorial 
v{G) ■ ■ ■ {v{G) — v{F) + 1). Then tind(-^) ■) extends by continuity to U, and 
by inclusion-exclusion, for graph limits T £ Uoo, tind{F,T) can be written as 
a linear combination of t{Fi,T) for subgraphs Fi C F. We can define iind 
for bipartite graphs in the same way; further details are in [5] and |10] . 

Theorem 3.1. (i) Let T £ Uoo; ^-c, T is a graph limit. Then T G 7^ if 
and only i/tind(A,r) = tind(C4,r) = tind{'^K2,T) = 0. 

(ii) Let r G ^oooo/ ^-e., T is a bipartite graph limit. Then T £ oo ^/ 
and only if ti^d{'^K2,T) = 0. 



In view of (1.5) and (1.12), this is a special case of the following simple 
general statement. 

Theorem 3.2. Let T = {Fi, F2, . . . } he a finite or infinite family of graphs, 
and let lAj: (lU be the set of all graphs that do not contain any graph from 
T as an induced subgraph, i.e., 

Ur:={G£U: ti,,d{F, G) = for F £ J^}. 

LetUjr be the closure ofUj: inU. Then 

Ur-={T£U: tiUF, r)=0 for F£ T}. 

In other words, if T £ Uoo is a graph limit, then T is a limit of a sequence 
of graphs in Uj^ if and only if ti^^^F, T) = for F £ T . 

Conversely, if F £ Ujr n Uoo is represented by a function W, then the 
random graph G{n,W) £ Ujr (almost surely). 

The same results hold in the bipartite case. 

Proof. If G„ ^ r with G £ Ur, then t{F, T) = hm^^oo t{F, Gn) = for 
every F £ T.,hy the continuity of t[F, •). 

Conversely, suppose that T £ Uoo and t[F, F) = for F G .7^, and let T 



be represented by a function W. It follows from (3.1) that \i F £ T then 
Et(F, G(n, W)) = t{F, T) = 0, and thus t{F, G{n, W)) = a.s.; consequently 
G{n,W) £ Ujr a.s. This proves the second statement. Since G(n, VF) T 
a.s., it also shows that T is the limit of a sequence in Uj^, and thus T £ Uj^, 
which completes the proof of the first part. □ 

4. Degree distributions 

The results in this section hold for general graphs, they are applied to 
threshold graphs in section Section [5} 
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Let V be the set of probability measures on [0,1], equipped with the 
standard topology of weak convergence, which makes V a compact metric 
space (see e.g. Billingsley |3]). 

If G is a graph, let d{v) = dciy) denote the degree of vertex v G V[G)^ and 
let Dg denote the random variable defined as the degree dciv) of a randomly 
chosen vertex v (with the uniform distribution on V{G)). Thus < Dq < 
v{G) — 1. For a bipartite graph we similarly define Dq-j as the degree dciv) 
of a randomly chosen vertex v € Vj{G), j = 1,2. Note that < Dg-i < 
V2{G) and < Dg-2 ^ vi(G). Since we are interested in dense graphs, 
we will normalize these random degrees to Dg/v{G) and, in the bipartite 
case, Dg-i/v2{G) and Dg-2/vi{G)] these are random variables in [0,1]. The 
distribution of Dg/v{G) will be called the (normalized) degree distribution 
of G and denoted by i'{G) £ V; in other words, i^(G) is the empirical 
distribution function of {dG{v)/v{G) : v G V{G)}. In the bipartite case we 
similarly have two (normalized) degree distributions: i^i(G) for Vi{G) and 
z^2(G) for V2{G). 

The moments of the degree distribution(s) are given by the functional 
t{F, •) for stars F, as stated in the following lemma. We omit the proof, 
which is a straightforward consequence of the definitions. 

Lemma 4.1. The moments of u{G) are given by 

[\''du{G){t)=t{Ki,k,G), k>l, (4.1) 
Jo 

where Ki^k is a star with k edges. 

In the bipartite case, similarly, for k > 1, 

[\''dMG){t) = t{Ki^k,G), C t''du2{G){t) = t{Kk,i,G). (4.2) 
Jo Jo 

This enables us to extend the definition of the (normalized) degree distri- 
bution to the limit objects by continuity. 

Theorem 4.2. If G n are graphs with v(Gri) — > cxd and Gn — ^ T for some 
r £ U as n ^ oo, then v{Gn) ^(r) for some distribution i^(T) G V. 
This defines the 'degree distribution' i^iT) (uniquely) for every graph limit 



r G Uoo, and T ^ z/(r) is a continuous map U^o —>■ V . Furthermore, (4.1) 
holds for all G £U. 

Similarly, in the bipartite case, vi and U2 extend to continuous maps 
B ^ V such that (4.2) holds for all G £ B. Furthermore, J^2(r) = '^i(r^^) 
for r G F. 



Proof. An immediate consequence of Lemma 4.1 and the method of mo- 



ments. The last sentence follows from (4.2) and (3.3). □ 



Remark. Theorem 4.2 says that the degree distribution is a testable graph 
parameter in the sense of Borgs, Chayes, Lovasz, Sos and Vesztergombi [5 , 
see in particular [5, Section 6]. (Except that takes values in V instead of 
M.) 
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If r is represented by a function W on [0, 1] , we can easily find its degree 
distribution from W. 

Theorem 4.3. If W ^ Ws, then viVw) equals the distribution of W{U, y) dy, 
where U C/(0, 1). 

Similarly, in the bipartite case, if W £ W, then i/i(r'^) equals the distri- 
bution of Jq W{U, y) dy and i'2(r'\v) equals the distribution of W{x, U) dx. 



Proof. By (|4^ and ( |3.1[ ), 

[\''du{Twm=t{K,^k,rw)= [ ([ W{x,y)dy] dx 
Jo J[0,1] \J[0,1] J 



k 



= E^^ ^W{U,y) dyj 

for every A; > 1, and the result follows. The bipartite case is similar, using 

□ 

If a graph G has n vertices, its number of edges is 
\E{G)\ = \ dW = ^IEI?G = yIEpG/n) = y AdKG)(t). 

Hence, the edge density of G is 

\E{G)\/(^^ = ^ l\du{Gm. (4.3) 

If (Gn) is a sequence of graphs with v{Gn) — > oo and G„ ^ T G Uoo, 
we see from (4.3 1 and Theorem 4.2 that the graph densities converge to 
J^tdu{T){t), the mean of the distribution i^(T), which thus may be called 
the (edge) density of F G Uoo- 



If r is represented by a function W on [0, 1]^, Theorem 4.3 yields the 
following. 

Corollary 4.4. has edge density J f^^ ^2 W{x, y) dx dy for every W G 

Proof. By Theorem |4.3[ the mean of fi{Tw) equals 

E [ W{U,y)dy= [ [ W{x,y)dxdy. □ 
Jo Jo Jo 

5. Limits of threshold graphs 



Recall from Subsection 3.5 that T^o is the set of limits of threshold graphs, 
and 7^ ^ is the set of limits of bipartite threshold graphs. Our purpose in 
this section is to characterize the threshold graph limits, i.e. the elements of 
Toe and 7^ ^ , and give simple criteria for the convergence of a sequence of 
threshold graphs to one of these limits. We begin with some definitions. 
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A function W : [0, 1]^ ^ R is increasing if W{x, y) < W{x' , y') whenever 
< X < j;' < 1 and < y < y' < 1. A set S C [0, 1]^ is increasing if its 
indicator I5 is an increasing function on [0,1]^, i.e., if {x,y) G S implies 
(x', y') G S whenever < x < x' < 1 and < y < y' < 1. 

fi £ V, let be its distribution function F^{x) := /x([0,x]), and let 
Ffii^—) '■= /^([0,x)) be its left-continuous version. ThusF^(0— ) = < F^{0) 
and Ff,{l-) < 1 = Further, let F'^ : [0,1] [0,1] be the right- 

continuous inverse defined by 

F-\x) := sup{t < 1 : F^{t) < x}. (5.1) 

Note that F^^{0) > and F-^{1) = 1. Finally, define 

:= {{x,y) G [0, 1]^ : x > F^((l - y)-)}. (5.2) 

It is easily seen that is a closed increasing subset of [0,1]^ and that it 
contains the upper and right edges {(x, 1)} and {(1, y)}. Since x > F^{^{1 — 
y)—) ■^=^ F~^{x) > 1 — y, we also have 

= {ix,y) G [0,1]2 : F-\x) + y > l}. (5.3) 

We further write Wfj, := 1^^ and let F^ := F"(W^) and, when W is 
symmetric, F^ := F(W^). We denote the interior of a set 5* by 5°. It is 
easily verified from (5.2) that 

5° = {(x,y)G(0,l)2:x>F^(l-y)}. (5.4) 

Recall that the Hausdorjf distance between two non-empty compact sub- 
sets Ki and K2 of some metric space S is defined by 

dniKi, K2) := max(max (i(x, ^2)5 max (i(y, Ki)) . (5-5) 

x^Ki y&K2 

This defines a metric on the set of all non-empty compact subsets of 5. If 5 
is compact, the resulting topology on the set of compact subsets of S (with 
the empty set as an isolated point) is compact and equals the Fell topology 
(see e.g. [191 Appendix A. 2]) on the set of all closed subsets of S. 

Let Arf denote the Lebesgue measure in M*^. For measurable subsets ^i, ^2 
of [0,1]^, we also consider their measure distance A2(5'iAS'2). This equals 
the L^-distance of their indicator functions, and is thus a metric modulo 
null sets. 

For functions in W we also use two different metrics: the L^-distance 
I[o i]2 y) "^2(2;, y)| dx dy and, in the symmetric case, the cut-distance 

(5n defined by Borgs, Chayes, Lovasz, Sos and Vesztergombi [5j, and in the 
bipartite case its analogue S'^, see Section [s] Note that the cut-distance is 
only a pseudo-metric, since the distance of two different functions may be 0. 
Note further that the cut-distance is less than or equal to the L^-distance. 

We can now prove one of our main results, giving several related charac- 
terizations of threshold graph limits. There are two versions, since we treat 
the bipartite case in parallel. 
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The bipartite case. It is convenient to begin with the bipartite case. 

Theorem 5.1. There are bijections between the set ^ of graph limits of 
bipartite threshold graphs and each of the following sets. 

(i) The setV of probability distributions on [0,1]. 

(ii) The setCs of increasing closed sets S C [0, 1]^ that contain the upper 
and right edges [0, 1] x {1} U {1} x [0, 1]. 

(iii) The set Os of increasing open sets S C (0, 1)^. 

(iv) The set of increasing 0-1 valued functions W : [0, 1]^ ^ {0, 1} 
modulo a.e. equality. 

More precisely, there are commuting bijections between these sets given by 
the following mappings and their compositions: 





T" 




ii3p(r) 






V 






— Sfj.] 




Cb- 


►Ob, 




= S°; 


I'CW 


Cb- 


► Wb, 




= is; 




Ob- 






= is; 




m- 


T" 

00,00' 


iWB{W) 


— r" 



(5.6) 



l^BV 




In particular, a probability distribution fi £ V corresponds to G '^,00 '^^'^ 
to £ Cb, S° £ Ob, and G Wg. Conversely, T £ T^^^ corresponds to 
z/i(r) £ V. Thus, the mappings T 1^1 (F) and /i 1— > are the inverses of 
each other. 

Moreover, these bijections are homeomorphisms, with any of the following 
topologies or metrics: the standard (weak) topology on V; the Hausdorff 
metric, or the Fell topology, or the measure distance on Cb,' the measure 
distance on Ob; the L^-distance or the cut-distance on the set Wb- 



Proof. The mappings in (5.6) are ah well-defined, except that we do not 
yet know that iyyg maps Wg into 7^ ^ . We thus regard tyvB as a map 

Wb Boooo and let B := iwi3(W'B) be its image; we will identify this as 
7^ later. For the time being we also regard lbv as defined on B (or on 

ah of ;SoOOo)- 

Consider first ipc -V ^ Cb- By (5.2), 5^ determines at all continuity 
points, and thus it determines fi. Consequently, tpc is injective. 



22 



PERSI DIACONIS, SUSAN HOLMES, AND SVANTE JANSON 



S £ Cts and y £ [0, 1], then {x : (x, y) G 5} is a closed subinterval of [0, 1] 
that contains 1, and thus S = {{x,y) G [0, 1]'^ : x > g{y)} for some function 
g : [0,1] [0,1]. Moreover, g{l) = 0, g is decreasing, i.e. 5(2/2) < g{yi) 
if 2/1 < 2/2) and, since 5 is closed, g is right-continuous. Thus g{l — x) is 
increasing and left-continuous, and hence there exists a probability measure 
fj.£V such that Ff,{x-) = g{l- x), x £ [0, 1]. By K^, then 



ivcil^) =S^ = {(x, y) G [0, lf:x> 5(2/)} = 5. 

Hence Lpc is onto. Consequently, bpc is a bijection of V onto Cg. 

If 5i and ^2 are two different sets in Cg, then there exists a point (x, y) G 
\ 5*2, say. There is a small open disc with center in (x,y) that does 
not intersect S2, and since Si is increasing, at least a quarter of the disc is 
contained in 5i \ ^2. Hence, A2(5'iA52) > 0. Similarly, if 5i and 52 are 
two different sets in Os and (x, y) G \ ^2, then there is a small open disc 
with center in (x,y) that is contained in Si, and since 5*2 is increasing, at 
least a quarter of the disc is contained in 5i \ 5*2, whence A2(5'iA52) > 0. 
This shows that the measure distance is a metric on Cs and on Og, and that 
the mappings icw and iovv into Wb are injective (remember that a.e. equal 
functions are identified in Wg). 

Next, let S C [0, 1]^ be increasing. If (x, y) £ S with x < 1 and y < 1, it is 
easily seen that {x,x + 5) x (y,y + 5) C 5 for = min{l — x, 1 — y}, and thus 
(x, x + 6) X {y,y + 6) CI S°. It follows that, for any real o, the intersection of 
the boundary dS := S\S° with the diagonal line := {(x, x -|- a) : x G M} 
consists of at most two points (of which one is on the boundary of [0, 1]^). 
In particular, Xi{dS Ci La) = and thus 



X2{dS) = 2-^'^ j \i{dSr\La)da = {). (5.7) 

Consequently, dS is a null set for every increasing S. Among other things, 
this shows that if 5 G Cg, then ioW^co{S) = 15° = I5 a.e. Since elements 
of Wg are defined modulo a.e. equality, this shows that iovV-CO = ^CW • 

If G Wg, and thus W = Is for some increasing 5 C [0, 1]^, let 



S:=5U[0,1] x{l}U{l} X [0,1]. (5.8) 



Then S £ Cb and (5.7| implies that lcw{S) = Ig = Is = W a.e. Similarly, 
S° £ Ob and L,owiS°) = Is = W a.e. Consequently, tew and low are onto, 
and thus bijections. Similarly (or as a consequence), lco is a bijection of Cg 
onto Ob, with inverse S ^ S given by ( |5.8| ). 

Note that the composition icW-vc maps /x 1— > 1^^ = W^, and let tpg 
be the composition iyvBi-cwL-vc '■ ^ r"(VF^) = T'^ mapping V into ^oooo- 
Since tpc and lqw are bijections, its image /-g-p('P) = '-WB(W'g) = B £i ^oooo- 
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fi £ V, then the composition lbv i^VB^fj) = ^i^^'L) equals by Theorem 



4.3 



and (5.3) the distribution of 

f\s,{U,y)dy = F^\U). (5.9) 

JO 

As is well-known, and easy to verify using (5.1), this distribution equals /i. 
Hence, the composition itsv^VB is the identity. It follows that t-pg is injective 
and thus a bijection of V onto its image B, and that i^-p (restricted to B) is 
its inverse. 

We have shown that all mappings in (5.6) are bijections, except that we 
have not yet shown that B = 7^ ^ . We next show that the mappings are 
homeomorphisms . 

Recall that the topology on V can be defined by the Levy metric defined 
by (see e.g. [HI Problem 5.25]) 

'^l(/^i,/"2) := inf{e > : F^^{x - e) - e < F^^{x) < Ff^^{x + e) +e for all x}. 

(5.10) 

If /Ui,^2 S V with dLifJ-i, 1^-2) < e, it follows from (5.2) and (5.10) that if 
(x, y) G Sfj,^ and x,y < 1 — e, then 

F^,{{1 - y - e)-) <F^,{{l-y)-)+e<x + e 

and thus {x + e,y + e) £ • Considering also the simple cases x £ [1 — e, 1] 
and y £ [I — e,l], it follows that if {x,y) £ S'^j, then d(^{x,y), S^^) < \f2e. 
Consequently, by (5.5) and symmetry, 

which shows that L-pc is continuous if is given the topology given by the 
Hausdorff metric. 

The same argument shows that for any {xQ,yQ), the intersection of the 
difference S'^^AS'^j with the diagonal line La defined above is an interval of 
length at most dL{^i, fi2), and thus, by integration over a as in (5.7), 

X2{S^^AS^^) < 2dL(^i,/i2). 

Hence, L-pc is continuous also if C13 is given the topology given by the measure 
distance. 

Since V is compact and tpc is a bijection, it follows that Lpc is a home- 
omorphism for both these topologies on Cg. In particular, these topologies 
coincide on Cq. As remarked before the theorem, since [0,1]^ is compact, 
also the Fell topology coincide with these on Cg. 

The bijections lcOi t-cw and lqw are isometries for the measure distance 
on Cg and Oq and the L^-distance on Wg, and thus homeomorphisms. 
Furthermore, still using the L^-distance on Wg, it is easily seen from (3.2), 
as for the symmetric case in |22j . [5], that for every fixed bipartite graph 
F, the mapping W 1— > t{F,T'^) is continuous, which by definition of the 
topology in ;Boooo means that tyvB : W 1— > T'^ is continuous. Hence, the 
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bijection tyvB is a homeomorphism of the compact space Wb onto its image 
B. 

As said above, the cut-distance is only a pseudo-metric on W. But two 
functions in W with cut-distance are mapped onto the same element in 
Boooo, and since we have shown that is injective on Wg, it follows that 
the restriction of the cut-distance to is a metric. Moreover, the identity 
map on Ws is continuous from the L^-metric to the cut-metric, and since 
the space is compact under the former metric, the two metrics are equivalent 
on Wb- 

We have shown that all mappings are homeomorphisms. It remains only 
to show that B = T^ ,^. To do this, observe first that if G is a bipartite 
threshold graph, and we order its vertices in each of the two vertex sets 
with increasing vertex degrees, then the function Wq defined in Section [3] is 
increasing and belongs thus to Wg. Consequently, vr(G) = iwB(^G) £ ^■ 
If r e '?^_oo) then by definition there exists a sequence Gn of bipartite 
threshold graphs with vi{Gn)-,V2{Gn) — > oo such that Gn -^T mB. This 
implies that vr(G„) — > T in ^Soooo, and since vr(Gn) G B and B is compact 
and thus a closed subset of Boooo-, we find T ^ B. 

Conversely, if T e ;B, then T = iwBi^cw^S) for some set S G Cb- For 
each n, partition [0,1]^ into closed squares Qij of side 1/n, and let Sn 
be the union of all Qij that intersect S. Then 5"^ £ Cg, 5 C 5„ and 
dniSn, S) < \/2/n. Let Wn := l5„ = t'Cw{Sn) and let r„ := L^^B^Wn) G B. 
Since Lew and i>vB are continuous, Wn W := Is in Wg and r„ 
iWBiW) = r in B C B 

oooo- However, Wn is a step function of the form 
W{Gn) for some bipartite graph G„ with fi(G„) = t'2(G„) = n, and thus 
7r(Gn) = ^w,^ = Tn- Moreover, each Sn and thus each Wn is increasing, and 
hence G„ is a bipartite threshold graph. Since '/r(G„) = r„ ^ T in Boooo, it 
follows that Gn — > r in ;B, and thus T £ ^. 

Consequently, B = ^ , which completes the proof. □ 

Remark 5.1. Another unique representation by increasing closed sets is 
given by the family Cg of closed increasing subsets S of [0, 1]^ that satisfy 
S = S°; there are bijections Cg Ob and Ob C'jg given by S i— > 5° 
and S S. We can, again, use the measure distance on Cg, but not the 
Hausdorff distance. (For example, [0, 1] x [1 — e, 1] ^ in C'q as £ ^ 0.) 

Corollary 5.2. The degree distribution yields a homeomorphism T ^ i^i(r) 
of T^^oo onto V. 

Of course, F i— > J^2(F) = i^i(F^) yields another homeomorphism of T^oo 
onto V . To see the connection between these, and (more importantly) to 
prepare for the corresponding result in the non-bipartite case, we investigate 
further the reflection involution. 

If S" C [0, let St := {{x,y) : (y,x) G S} be the set S reflected in the 
main diagonal. Thus = l^- We have defined the reflection map f for 
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bipartite graphs and graph hmits, and for the sets and functions in Theo- 
rem 5.] [ii)|[iii)|[iv) and it is easily seen that these correspond to each other 
by the bijections in Theorem |5.1[ Consequently, there is a corresponding 
map (involution) ^ ^ olV onto itself too. This map is less intuitive than 



the others; to find it explicitly, we find from (5.2), (5.3) and S 



Sji that 



>F^.t((i-y)-) ^ {y,x)eS, 



and thus F^t ((1 -y)-)=l- F-\y) and 



FAt) 



l-F-i((l-t)-), 0<t<l. 



(5.11) 



This means that the graph of the distribution function is reflected about the 
diagonal between (0, 1) and (1, 0) (and adjusted at the jumps). 

The map f is continuous on P, by Theorem |5 . 1 1 and the obvious fact that 
S" I— > 5Ms continuous on, for example, C^. 

We let Vs := {l^ V : = ^l} = {iJL £ V : = S],} be the set of prob- 
ability distributions invariant under the involution f . Since f is continuous, 
Vs is a closed and thus compact subset of V. 



Remark 5.2. If G Vs-, let xq := 1 - inf{x : (x,x) G 5^}. Then ( [5^ and 
(5.4) imply that F^^xq—) < 1 — xq < F^(xo), and the restriction of to 
[0, Xo) is an increasing right-continuous function with values in [0,1 — xq] 



and this restriction determines F^{t) for x > xq too by (5.11). 

Conversely, given any xq G [0, 1] and increasing right-continuous F : 
[0, Xo) — > [0, 1 — Xo], there is a unique fi £Vs with F^{x) = F{x) for x < xo 
and F^(xo) > 1 - xq. 



Non-bipartite case. We can now state our main theorem for (non-bipartite) 
threshold graph limits. 

Theorem 5.3. There are bijections between the set 7^ of graph limits of 
threshold graphs and each of the following sets. 



The setVs of probability distributions on [0, 1] symmetric with respect 
to t- 

The set Cq- of symmetric increasing closed sets S C [0, 1]^ that con- 
tain the upper and right edges [0, 1] x {1} U {1} x [0, 1]. 
The set Oq- of symmetric increasing open sets S C (0, 1)^. 
The set W-r of symmetric increasing 0-1 valued functions W : [0, 1]^ — 
{0,1} modulo a.e. equality. 



(iii) 
(iv) 
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More precisely, there are commuting bijections between these sets given by 
the following mappings and their compositions: 











= u{T) 




Vs- 


>Ct, 




— Sfj.] 


l^CO 


Cr- 


-Or, 




= S°; 




Cr- 


- Wr, 




= is; 




Or- 


Wr, 




= is; 


iyvr 


Wt 


' -'oo ) 


iwr{W) 


= Tw- 



(5.12) 



In particular, a probability distribution fj, £ Vs corresponds to S 7^ and 
to Sfj, G Cr, S° G Or, and G Wr- Conversely, T £ 7^ corresponds to 
z^(r) G Vs- Thus, the mappings T i— > i/(r) and // i— > are the inverses of 
each other. 




Moreover, these bijections are homeomorphisms, with any of the following 
topologies or metrics: the standard (weak) topology on Vs C V ; the Hausdorff 
metric, or the Fell topology, or the measure distance on Cr; the measure 
distance on Or; the L^-distance or the cut-distance on the set Wr- These 
homeomorphic topological spaces are compact metric spaces. 

Proof. The mappings Lpci i^cOi '-CW) '■OW are restrictions of the correspond- 



ing mappings in Theorem 5.1 and it follows from Theorem 5.1 and the defini- 
tions that these mappings are bijections and homeomorphisms for the given 
topologies. The spaces are closed subspaces of the corresponding spaces in 



Theorem 5.1 , since f is continuous on these spaces, and thus compact metric 
spaces. 



The rest is as in the proof of Theorem 5.1 and we omit some details. It 
follows from Theorem 4.3 that the composition l-wbi-CW^VC '■ = 
is a bijection of Vs onto a subset T' of 7^ with tr-p as its inverse. 
It follows that these mappings too are homeomorphisms, and that the L^- 
distance and cut-distance are equivalent on Wr- 

To see that T' = Xx, we also follow the proof of Theorem 5.1 A minor 



complication is that if G G T is a threshold graph, and we order the vertices 
with increasing degrees, then Wg is not increasing, because Wg{x, x) = for 
all X since we consider loopless graphs only. However, we can define W*{G) 
by changing Wq to be 1 on some squares on the diagonal so that W*{G) is 
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increasing and thus G Wr, and the error || Wg'-VF*(G) H^i < l/v{G). 

If we define 7r*(G) := T{W*{G)) G T', we see that if [Gn) is a sequence of 
threshold graphs with v{Gn) — > oo, then for every graph F, by a simple 
estimate, see e.g. [5^ Lemma 4.1], 

|t(F,7r*(G0)-t(F,G„)| <e(F)||VF(G„)-Ty*(G„)||ii <e(F)MG„) ^0. 

(5.13) 

It follows that Gn — > r in Z// if and only if ■n*{Gn) ^ T in lioo- If T G 7^, 
then there exists such a sequence Gn T, and thus 7r*((S'„) — > F in Uoo, 
and since 7r*(Gn) G and T' is compact, we find T G T' . 

The converse follows in the same way. If F G T' , then T = iy[;T{W) for 
some function W G Wt- The approximating step functions Wn constructed 



in the proof of Theorem 5.1 are symmetric, and if we let W* by the modifi- 
cation that vanishes on all diagonal squares, W* = Wg„ for some threshold 
graph Gn, and for every graph F, 

t{F, Gn) = t{F, W*) = t{F, Wn) + o(l) = t{F, W) + o(l). 

Hence, G„ = T inU, and thus T G T^q. Consequently, T' = Too. 

Corollary 5.4. The degree distribution yields a homeomorphism T i— > i^(r) 
of Too onto the closed subspace Vs ofV. 

Remark 5.3. The fact that a graph limit T can be represented by a function 
Ty G Wr if and only if tind(A,r) = ti„d(G4,r) = tind(2K2,r) = 0, which 
by Theorem 3.1 is equivalent to the bijection Too ^ W-r in Theorem 5.3 is 



also proved by Lovasz and Szegedy |23) . 

We have described the possible limits of sequences of threshold graphs; 
this makes it easy to see when such sequences converge. 

Theorem 5.5. Let Gn be a sequence of threshold graphs such that v{Gn) 
OO. Then Gn converges inU asn ^ oo, if and only if the degree distributions 
^{Gn) converge to some distribution /i. In this case, fi € Vs and Gn T^. 



Proof. As in the proof of Theorem 5.3 G„ ^ T if and only if 7r*(G„) — > T in 



u{Tr*{Gn)) equals the distribution of Jq W*{Gn){U,y) dy, but 
variable differs by at most l/t;(G„) = o(l) from the random 



T' = Too, wh ich by Theorem 5.3 holds if and only if z^(7r*(G„)) i^(r). By 
Theorem 4.3 
this random 

variable WG„{U,y) dy, which has degree distribution i/{Gn)- The result 
follows. □ 

Theorem 5.6. Let Gn be a sequence of bipartite threshold graphs such that 
vi{Gn), V2{Gn) — > OO. Then Gn converges in B as n ^ oo, if and only if the 
degree distributions i^i(Gn) converge to some distribution fi. In this case, 
z^2(G„) fj) and Gn T'^. 



Proof. Gn r if and only if 7r(Gn) — > T in S = ;Boooo) which by Theorem 



holds if and only if i^i(7r(G„)) — > z^i(r). It follows from Theorem 4.3 that 



z^i(7r(G„)) = i^i(G„), and the result follows from Theorem 5.1 □ 



5.1 
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Remark 5.4. A threshold graph hmit T is, by Theorem |5 .31 determined by 
its degree distribution and the fact that it is a threshold graph limit. By 



Theorem 3.2 and Lemma 4.1 , T is thus determined by t{F, T) for F in the set 
{P4, C4, 2i^2) -^1,1) -^1,2) • • • }• Lovasz and Szegedy [23 have shown that in 
some special cases, a finite set of F is enough; for example, the limit defined 



by the function W{x, y) = l[x + y > 1] (see Example 1.3 and Figure [4| is the 
unique graph limit with t{P4,T) = t{Ci,T) = t{2K2,T) = 0, t{K2,T) = 1/2, 
t(P3,r) = l/3. 

6. Random threshold graphs 

We consider several ways to define random threshold graphs. We will only 
consider constructions with a fixed number n of vertices; in fact, we take 
the vertex set to be [n] = {1, . . . , n}, where n > 1 is a given parameter. By 
a random threshold graph we thus mean a random element of %i '■= {G S 
T : V{G) = [n]} for some n; we do not imply any particular construction or 
distribution unless otherwise stated. (We can regard these graphs as either 
labeled or unlabeled.) 

This section treats four classes of examples: a canonical example based 
on increasing sets, random weights examples, random attachment examples 
and uniform random threshold graphs. 

6.1. Increasing set. For any symmetric increasing S C [0, 1]^, we let W = 
Is and define Tn-s G{n, W) as in Section |3] In other words, we take i.i.d. 
random variables C/i, ...,[/« ~ U{0, 1) and draw an edge ij if ([/«, Uj) £ S. 

As said in Section [s] G{n,W) F^y, which in this case means that 
Tn;S ^(15) G Too- We denote F(l5) by F^ and have thus the following 



result, using also Theorem 4.3 



Theorem 6.1. As n ^ 00, Tn-s ~^ ^s- In particular, the degree distribu- 
tion I'iTn-s) ^(r^), which equals the distribution of 

ifs{U) := \{y : {U,y) G 5}| = ¥[{U,U') eS\U), (6.1) 
with U, U' ~ U{0, 1) independent. □ 



By Theorem |5.3[ this construction gives a canonical representation of the 
limit objects in Too, and we may restrict ourselves to closed or open sets as 
in Theorem 5.^ li)|[iii) to get a unique representation. We can obtain any 
desired degree distribution fj, £ Vs for the limit by choosing 5 = 5^. This 
construction further gives a canonical representation of random threshold 
graphs for finite n, provided we make two natural additional assumptions. 

Theorem 6.2. Suppose that (G„)^ is a sequence of random threshold graphs 
with V{Gn) = [n] such that the distribution of each Gn is invariant under 
permutations of [n] and that the restriction (induced subgraph) of Gn+i to 
[n] has the same distribution as Gn, for every n > 1. If further i^{Gn) — ^ fJ- 
as n 00, for some G "P, then fi £ Vs and, for every n, Gn = Tn-s^,- 
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5.5 



Proof. It follows from Theorem 5.5 that Gn — > T^. (To apply Theorem 
to convergence in probability, we can use the standard trick of considering 
subsequences that converge a.e., since every subsequence has such a subsub- 
sequence [191 Lemma 4.2].) 

If we represent a graph by its edge indicators, the random graph G„ can 
be regarded as a family of 0-1-valued random variables indexed by pairs 
(j,j), 1 < i < i < By assumption, these families for different n are con- 
sistent, so by the Kolmogorov extension theorem Theorem 6.16], they 
can be defined for all n together, which means that there exists a random 
infinite graph Goo with vertex set N whose restriction to [n] coincides (in 
distribution) with Gn- Moreover, since each G„ is invariant under permu- 
tations of the vertices, so is Goo, i-e.. Goo is exchangeable. By Aldous and 
Hoover [Ij, see also [20] and [10], every exchangeable random infinite graph 
can be obtained as a mixture of G(oo, W); in other words, as G(oo, W) for 
some random function W € Ws. In this case, the subgraphs Gn converge 
in probability to the corresponding random Tw, see Diaconis and Janson 
[lOj . Since we have shown that G„ converge to a deterministic graph limit 

r^, we can take W deterministic so it follows that Goo = G(oo, W) for some 



W G Ws; moreover, T^ = Tw, and thus we can by Theorem 5.3 choose 
W = Wfj_. (Recall that in general, W is not unique.) Consequently, 

Gn = G{n,W^)=Tn-s,. □ 



6.2. Random weights. Definition (1.1) suggests immediately the construc- 
tion 



1.6) 



Let Xi,X2, . . . , be i.i.d. copies of a random variable X, let f G M, and let 
Tn-x,t be the threshold graph with vertex set [n] and edges ij for all pairs 
ij such that Xi + Xj > t. (We can without loss of generality let t = 0, by 
replacing X hy X — t/2.) 

Examples |1.2| and |1.3| are in this mode. 

Let F{x) := F{X < x) be the distribution function of X, and let be 
its right-continuous inverse defined by 

F~^{u) := sup{x G M : F{x) < u}. (6.2) 



(Cf. (5.1), where we consider distributions on [0,1] only.) Thus — cxd < 
F^^{u) <ooifO<u< 1, while F~^{1) = oo. It is well-known that the 
random variables Xi can be constructed as F~^{Ui) with Ui independent 
uniformly distributed random variables on (0,1), which leads to the fol- 
lowing theorem, showing that this construction is equivalent to the one in 
Subsection 16.11 for a suitable set S. Parts of this theorem were found earlier 
by Masuda, Konno and co-authors |21| I26j. 

Theorem 6.3. Let S be the symmetric increasing set 

S := {(x, y) G (0, 1]^ : F-\x) + F'^y) > t}. (6.3) 

Then Tn-x,t = Tn-s for every n. 
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Furthermore, as n ^ oo, the degree distribution v{Tn-x,t) A* '^^^ thus 
Tn-x,t r^, where ji ^ is the distribution of the random variable 
l-F{t- X), i.e. 

;u[0,s] =P(l -F(t-X) < s), sG[0, 1]. 

Proof. Taking X-i = F^^{Ui), we see that 

there is an edge ij F'^{Ui) + F~^{Uj) > t {Ui, Uj) G S, 



(6.4) 



which shows that Tn-x,t = 



n:S- 



The remaining assertions now follow from Theorem 6.1 together with 
the calculation, with U,U' ~ C/(0,1) independent and X = F~'^(U), X' = 
F-\U'), 

ifsiU) = F{{U, U')e S\U) = F{F-\U) + F-\U') >t\U) 

= F{X + X' >t\X) = F{X' >t- X \ X) = 1- F{t- X). 

□ 



The set S defined in (6.3) is in general neither open nor closed; the cor- 
responding open set is 

S° = {(x, y) G (0, 1)2 : F-\x-) + F-\y-) > t}, 



and the corresponding closed set Sn in Theorem 5.3 can be found as 5*° from 



(5.8). If we assume for simplicity that the distribution of X is continuous, 
then, as is easily verified, 

5^ = {{x,y) G [0, 1]2 : F~\x) + F"\y) > t}, 

where we define F~^{1) = oo (and interpret oo + (— oo) = oo in case 
F~^{0) = — oo). We can use these sets instead of S in (6.3) since they 
differ by null sets only. 

6.3. Random addition of vertices. Preferential attachment graphs are 
a rich topic of research in modern graph theory. See the monograph |24j . 
along with the survey [30|. The versions in this section are natural because 
of 1(1.2) and the construction (1.7) 



Let Tn^p be the random threshold graph with n vertices obtained by adding 
vertices one by one with the new vertices chosen as isolated or dominating at 
random, independently of each other and with a given probability p £ [0,1] 
of being dominating. (Starting with a single vertex, there are thus n — 1 
vertex additions.) 

The vertices are not equivalent (for example, note that the edges li, i ^ 
1, appear independently, but not the edges m, i / n), so we also define 
the random threshold graph Tn,p obtained by a random permutation of the 
vertices in T„^p. (When considering unlabeled graphs, there is no difference 
between Tn^p and Tn^p.) 
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Remark 6.1. We may, as stated in (1.7)[ use different probabilities pi for 
different vertices. We leave it to the reader to explore this case, for example 
with Pi = f{i/n) for some given continuous function / : [0, 1] [0, 1]. 



Theorem 6.4. The degree distribution u(Tn^p) converges a.s. as n ^ oo to 

a distribution fip that, for < p < 1, has constant density {l—p)/p on (0,p) 
and p/{l — p) on {p, 1); is a point mass at and is a point mass at 1. 
In particular, fj,i/2 is the uniform distribution on [0,1]. 

Consequently, Tn,p T^^ €%o. 

Proof. Let Zn{t) be the number of vertices in {1, ... , [nt\ } that are added as 
dominating. It follows from the law of large numbers that n~^Zn{t) pt, 
uniformly on [0, 1], and we assume this in the sequel of the proof. 

If vertex k was added as isolated, it has degree ZniX) — Zn{k/n), since its 
neighbours are the vertices that later are added as dominating. Similarly, if 
vertex k was added as dominating, it has degree k — 1 + Zn{l) — Zn{k/n). 
Consequently, if is the (normalized) degree distribution of T„^p, and (j) is 
any continuous function on [0, 1], then 



1 " 

= -Y,{(t>{n-^ Zn{l) - n-^Zn{k/n))l[AZn{k/n) = 0] 
fc=i 

+ (f){n-^Zn{l)-n'^Zn{k/n) + {k-l)/n)l[AZn{k/n) = 1]). 

pt uniformly, and (p is uniformly continuous, it follows 



^ - k/n))l[AZn{k/n) = 0] 

k=l 

+ (t){p{l - k/n) + k/n)l[AZn{k/n) = 1]) + o(l) 

C (t>{p{l - t)) d(n-i [nt\ - n-'Zn{t)) 
Jo 

+ [ (l){p{l-t)+t)d{n-^Zn{t)) +o{l). 
Jo 



Since n ^Zn{t) 
that, as n ^ oo. 



/ (pit) dflnit) 

Jo 
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Since the convergence n ^Zn{t) pt implies (weak) convergence of the 
corresponding measures, we finally obtain, as n — > oo, 

(P{t)dfin{t)^ [ (t>{p{l-t)){l-p)dt+ [ ^{p{l-t)+t)pdt 
Jo Jo 

1-p p 

= / (j)(x) dx H / dx 

P Jo '^-pJp 

1 

(t){x)dfip{x), 

with obvious modifications ifp = Oorp = l. □ 




Figure 8. Two examples of the sets Sp] the one on the right 
shows the special case where p = 0.5. 



Let Sp := be the corresponding subset of [0, 1]^. If < p < 1, has 



the distribution function 



pX, 0<x<p, 
l-T^{l-x), p<x<l, 



(6.5) 



and it follows from ( |5.2[ ) that Sp is the quadrilateral with vertices (0,1), 
{1 — p,l — p), (1,0) and (1, 1), see Figure [sj In the special case p = 1/2, ftp 
is the uniform distribution on [0, 1], and Sp is the triangle {{x,y) G [0, 1]^ : 
x + y > 1} pictured in Figure |4] with vertices (0, 1), (1,0) and (1, 1). Finally, 
Sq consists of the upper and right edges only, and Si = [0, 1]^. 

Removing any vertex from Tn^p (and relabeling the remaining ones) yields 
Tn-i,p. It follows that the same property holds for Tn^p, so Tn^p satisfies the 

Since T„ „ has the same degree distribution as 



6.2 



assumptions of Theorem 

Tn^p, Theorems 6.2 and 6.4| show the following equality. 
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Corollary 6.5. If < p < 1 and n > 1, then Tn^p = Tn-Sp- 

Hence the random threshold graphs in this subsection are special cases 



of the general construction in Subsection 6.1 We can also construct them 



using random weights as in Subsection 6.2 



Corollary 6.6. If < p < 1 and n > 1, then T^^p = Tn-x,o, where X has 
the density 1 — p on (—1, 0) and p on (0, 1). 



Proof. A simple calculation shows that the set S given by (6.3) is the quadri- 



lateral Sp. □ 

We may transform X by a linear map; for example, we may equivalently 
take X with density 2(1— p) on (0, 1/2) and 2p on (1/2, 1), with the thresho 

t = 1. In particular, r„ = Tn;U,i: where U ~ f7(0, 1) as in Example 



1.3 



6.4. Uniform random threshold graphs. Let T„ be a random unlabeled 
threshold graph of order n with the uniform distribution studied in Section|2j 
Similarly, let be a random labeled threshold graph of order n with the 
uniform distribution. Although r„ and have different distributions, see 
Section [2] the next theorem shows that they have the same limit as n — > 00. 

Theorem 6.7. The degree distributions iy{Tn) and i'{T^) both converge in 

probability to the uniform distribution A on [0,1]. Hence, Tn ^— > T\ and 
rrL P 



A- 



By Subsection 



2.1 



Tn = T„ x/2) hence the result for unlabeled graphs 



follows from Theorem 16.4 

Proof. We use Theorem |2.4[ in fact, the proof works for random threshold 
graphs generated by Algorithm |2.5| for any i.i.d. random variables B2, -B3, . . . 
with finite mean, and any Bi. (In the case when B2 is always a multiple of 
some d > 1, there is a trivial modification.) Let (3 := Ei?2. 

The algorithm starts by choosing (random) block lengths Bi,B2, ■ ■ . until 
their sum is at least n, and then rejects them and restarts (Step 3) unless 
the sums is exactly n. It is simpler to ignore this check, so we consider the 
following modified algorithm: Take Bi, B2, ■ ■ ■ as above. Let Sk ■= Z]j=i Bj 
be their partial sums and let T(n) := min{A; : Sk > n}. Toss a coin to 
determine whether the first block is isolated or dominating, and construct a 
random threshold graph by adding T(n) blocks of vertices with Bi, . . . , B^(^n) 
elements, alternatingly isolated and dominant. 

This gives a random graph Gn with S^-f^n) vertices, but conditioned on 
Srin) = ^) we obtain the desired random threshold graph. (Cf. Theo- 
) Since P(S't-(„) = n) converges to 1//3 > by renewal theory. 



rem 



2.4 



it suffices to prove that z/(G„) — > A as n ^ 00. In fact, we will show that 
v{Gn) A if we first choose an infinite sequence Bi, B2, ■ ■ ■ and then let 
n — > 00. 
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Let := E2fe+i<m^2fc+i and 5"^ := T.2k<m^-ik be the partial sums 
of the odd and even terms. By the law of large numbers, a.s. Sn/n — ^ (3 
and S^/n Sfjn \(3. We now consider a fixed sequence (Bj)'^ 

such that these limits hold. Since S't-(„)_i < n < S't-^^), it follows, as is 
well-known, that n/T{n) /?, so r(n) = n/ [5 + o{n). 

Suppose for definitencss that the first block is chosen to be isolated; then 
every odd block is isolated and every even block is dominating. (In the 
opposite case, interchange even and odd below.) If i G {S2k, S2k+i\, then i 
belongs to block 2 A; + 1, so z is added as isolated, and the neighbors of i will 
be only the vertices added after i as dominating, i.e. Ufc<^<r(n)/2('^2^-i' 
and 

2k<2e<T(n) 

If instead i G (S'2fc-i, S2k], then i is also joined to all vertices up to S2k, and 
thus 

d{i)= Y B2e+ Yl ^2m = ^f(n) + ^r'(0- 

2l<T{n) 2i+l<T{i) 

Hence, if i is in an odd block, 

and if i is in an even block, similarly, 

= - H 1- oil). 

Now fix t G (0, 1/2) and let £ > 0. Then the following holds if n is large 
enough: If i is in an even block, then d{i)/n > 1/2 + o(l) > t. If z is in an 
odd block and i < ii := (1 — 2t — 2e)n, then d{i)/n = ^{n — i)/n + o(l) > 
t + £ + o(l) > t. If i is in an odd block and i > ^2 := (1 — 2t + 2£)n, 
then d{i)/n = ^{n — i)/n + o(l) < t — e + o(l) < t. Consequently, for 
large n, d{i)/n < t only if i is in an odd block {S2k, S2k+i], and in this case 
2A; + 1 > r(zi) is necessary and 2A; + 1 > r(z2) is sufficient. Hence, 

S?^„) - 5,%) < \{i : d{i)/n < t}\ < S% - S^.^y 
Since i/(G„)[0,i] = ■ d{i)/n < t}| and 

I^qO qO \ _ /^('r(n) - T{ij)) + o(n) _ n-ij + o{n) _ ^ ^ ^ , ^^^^ 
- l^(n) - -^rfe) j - ^ - ^ -t±e + o(l), 

it follows that 

t-e + o{l) < u{Gn)[0,t] <t + s + o{l). 

Since e is arbitrary, this shows that !/((?„) [0, t] t, for every t G (0, |). We 
clearly obtain the same result if the first block is dominating. 

For t G (i,l) we can argue similarly, now analysing the dominant blocks. 
Alternatively, we may apply the result just obtained to the complement of 
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Gm which is obtained from the same Bj by switching the types of the blocks. 
This shows that iy(G„)[0,t] ^ t for t G (|, 1) too. 

Hence, z^(Gn)[0, t\ ^ t for every t G (0, 1) except possibly |, which shows 
that v{Gn) X. □ 

7. Vertex degrees in uniform random threshold graphs 



We have seen in Theorem 6.7 that the normalized degree distributions 
v{Tn) and i^{T^) for uniform unlabeled and labeled random threshold graphs 
both converge to the uniform distribution on [0, 1]. This is for weak conver- 
gence of distributions in V, which is equivalent to averaging over degrees in 
intervals (an, bn); we here refine this by studying individual degrees. 

Let Nd{G) be the number of vertices of degree d in the graph G. Thus, Dq, 
the degree of a random vertex in G has distribution P(-Dg = d) = Nd/v{G). 
(Recall that v{G) is the distribution of Dg/v{G), see Section |4j) 

We will study the random variables Nd{Tn) and Nd{T^) describing the 
numbers of vertices of a given degree d in a uniform random unlabeled or 
labeled threshold graph, and in particular their expectations EA'^^(T„) and 
E A^d(r^); note that E A^d(r„)/n and E iVd(r^)/n are the probabilities that 
a given (or random) vertex in the random graph T„ or has degree d. By 
symmetry under complementation, 

Nd{Tn) = iV„-i_d(r„) and iVd(T„^) ^ 7V„_i_rf(T„^). 

Let us first look at A^O) the number of isolated vertices. (By symmetry, we 
have the same results for A^„_i, the number of dominating vertices). Note 
that, for every n > 2, P(iVo(T„) = 0) = P(A^o(7'i') = 0) = 1/2 by symmetry. 

Theorem 7.1. (i) For any n > 1, 

'2~J~\ 0<i<n-2, 
P(iVo(T„)=j) = |o, j = n-l, (7.1) 

^2-"+i, j = n. 

In other words, if X ^ Ge(l/2), then A'o(T„) = X'^, where X'^ := X„ if 
X < n — 1 and X'^ := n if X„ > n — 1. Furthermore, E A^o(^n) = Ij o-nd 
]^o{Tn) Ge(l/2) as n ^ oo, with convergence of all moments. 

(ii) P(A'o(T^) = j) = t{n, j) /t{n) , where t{n,j) is given by (2.7); in 



particular, if < j < n — 2, then 

with p = log2/(27r) 0.11. Hence, No{T^) Po(log2) as n ^ oo with 
convergence of all moments; in particular, KNq{T^) — > log 2. 

Proof, (i): A threshold graph has j isolated vertices if and only if the ex- 
tended binary code ai • • • a^, in Section [2] ends with exactly j O's. For a 
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random unlabeled threshold graph T„, the binary code 02 • • • is uniformly 
distributed, and thus (7.1) follows. The remaining assertions follow directly. 

(ii): In the labeled case, the exact distribution is given by (2.7 1, and the 
asymptotics follow by (2.4). Uniform integrabilit of any power Nq{T^)"^ 
follows by the same estimates, and thus moment convergence holds. □ 

For higher degrees, we begin with an exact result for the unlabeled case. 

Theorem 7.2. ENd(Tn) = 1 for every d = 0, . . . ,n - 1. 

Actually, this is the special case p = 1/2 of a more general theorem for the 



random threshold graph Tn,p defined in Subsection 6.3: (Cf. Theorem 6.4 



which is for weak convergence, but on the other hand yields an a.s. limit 
while we here study the expectations.) 

Theorem 7.3. Let 0<p<l. If q = 1 — p and X ^ Bin(n,p), then, for 
< d < n - 1, 



IEiVd(T„,p) 



p 



^)f{x < d). 

pJ 



Proof. We use the definition in Subsection |6.3[ (For the uniform case p = 
1/2, this is Algorithm 2.1 ) Let di be the degree of vertex i. Then, if ai • • • 
is the extended binary code of the graph, we have 

n 

di = {i- l)ai + ^ aj. 

j=i+i 

Since the are i.i.d. Be(;5) for i = 2,...,n, the probability generating 
function of di is 

n 

j=i+i 

Consequently, 

n n 



y^ENd{Tn,p) 



d 



X 



i=l 

{px + g)" 1 — ipx + qY 



1=1 



i=l 



P 



+ 



X — {px + q) ' ^ 1 — {px + q) 
{q/p) + {p/q - q/p){px + qY - {p/q)x''- 



1 



In the special case p= 1/2, this is (1 — x") /(I — x) = Y1^^=q ^'^^ which shows 
Theorem |7.2| by identifying coefficients. For general p. Theorem 7.3 follows 
in the same way. □ 



Recall that denotes the number of preferential arrangements, or sur- 
jection numbers, given in (2.5). 
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Theorem 7.4. (i) In the unlabeled case, for any sequence d = d{n) with 

< d < n — 1, Nii{Tn) — ^ Ge(l/2) with convergence of all moments. 

(ii) In the labeled case, let X^, < d < oo, have the modified Poisson 
distribution given by 



d = i) 




(Po(log2)=^)=7di^^, i>l, 

1 = 0, 



Id 



2 log 2 ' 

where 70 := log 2, 7^ := 2Rd(^og2Y^^ / d\ for d>l, and 700 := 1- Then, for 

every fixed d > 0, N^iT^) = ^d, o,nd for every sequence 

d = d{n) CO with n — d ^ 00, Nd(T^) = Nn-i-d(T^) as n ^ 00, 

in both cases with convergence of all moments. 
In particular, E Nd{T^) = EN^-i-diTa) 

converges to 7^ for every fixed 

d, and to joo = 1 if d ^ 00 and n — d ^ 00. 



In the labeled case we thus have, in particular, K Nq(T^) log 2 ~ 
0.69315, EiVi(r^) ^ 2(log2)2 w 0.96091, EN2{T^) 3(log2)3 « 0.99907, 
EN-siT^) f (log2)^ PS 1.00028. The values for degrees and 1 (and 
symmetrically n — 1 and n — 2) are thus substantially smaller than 1, which 
is clearly seen in Figure [7} (We can regard this as an edge effect; the vertices 



with degrees close to or n — 1 are the ones added last in Algorithm 2.5 
Figure [7] also shows an edge effect at the other side; there is a small bump 
for degrees arond n/2, which correspond to the vertices added very early in 
the algorithm; this bump vanishes asymptotically, as shown by Theorem |7.4[ 
we believe that it has height of order n"^/^ and width of order n^^'^, but we 
have not analyzed it in detail.) 



Proof. The cases d = and d = n — 1 follow from Theorem 7. 1 We may 



thus suppose 1 < d < n — 2. We use Algorithm 2.5 We know that vertices 
in each block have the same degree, while different blocks have different 
degrees; thus there is at most one block with degrees d. 

Let Pd{(-) be the probability that there is such a block of length i > 1, 
and that this block is added as isolated. By symmetry, the probability that 
there is a dominating block of length I with degrees d is p^-i-d and thus 

nNd = ^)=Pd{^)+Pn-i^d{^), ^>1. (7.2) 

If block j is an isolated block, then the degree of the vertices in it equals 
the number of vertices added as dominating after it, i.e., -Bj+i +Sj+3 + • • • + 
Bj+2k-i, if the total number r of blocks is j + 2A: — 1 or j + 2k. Consequently, 
there is an isolated block of length i with vertices of degree d if and only if 
there exist j > 1 and A; > 1 with 

• Bj = £, 

• block j is isolated, 

• Y^i=i Bj+2i-i = d, 

• T;i^~'B, = n or Tjl^B, = n. 
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Recall that Bi,B2, ■ ■ ■ are independent and that i?2, -B3, . . . have the same 
distribution while Bi has a different one. (The distributions differ between 
the unlabeled and labeled cases.) Let 

m m 

Sn--='^Bi and Sn ■= y^-Bi+i. 

i=l i=l 

Further, let 



u{n) = J2 = n)= ¥{Br = n) = = 

m=0 1=1 

00 

u{n) = J2nSm = n), 

m=0 

and recall that u(n) , n(n) 1/fj, := I/KB2 (exponentially fast) by standard 
renewal theory (for example by considering generating functions). For any 
m > j + 2k — 1, 

m k C „ . _ 

ER R ^ R J ^ni-l-k, J - i, 

Bi-B,-l^Bj+2i-i - 

i=l i=l {^m-l-k, J ^ ^, 

and it follows that, since Bi,B2,. ■ ■ are independent and we condition on 
Sr = n, 

^ 00 

^ ' k=i 

■ (P(5fc-i = n-i-d)+ ¥{Sk = n-£-d)) 

00 00 

j=2 k=l 

■ {F{Sj+k-2 = n-i-d)+ F{S,+k-i = n-i-d))] (7.3) 

In the double sum, f{Bj = i) = P(-B2 = ^) does not depend on j, so the 
sum is at most 

F{B2 = e)Yl F{Sk = d)2u{n - £ - d) = 2 ¥{B2 = i)u{d)u{n - i - d) 
k 

= 0{F{B2 = £)). 

Similarly, the first sum is 0{F{Bi = £)) = 0{F{B2 = £)), and it follows that 
Pd{£) = 0{F{B2 = £)) and thus, by ( [7^ , 

F{Nd = £) = 0{F{B2 = £)), (7.4) 

uniformly in n, d and £. This shows tightness, so convergence F^N^i = £) ^ 
F{X = £) for some non-negative integer valued random variable X and each 
fixed £>! implies convergence in distribution (i.e., for £ = Q too). Further, 
since all moments of B2 are finite, (7.4) implies that all moments KNJ^ are 
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bounded, uniformly in d and n; hence convergence in distribution implies 
that all moments converge too. In the rest of the proof we thus let £ > 1 be 
fixed. 

If d < n/2 it is easy to see that F{Sk-i = n-i-d)+F{Sk = n-l-d) = 
0[{n — i — = 0{n~^l'^\ uniformly in fc, so the first sum in (7.3) 

is 0{n-^l'^u[d)) = 0{n-^/'^). If d > n/2, we similarly have ¥{Sk = d) = 
0(d~i/2|^= 0(n-i/2) andthusthesumisO(n"V2^t(j^_£_j)) = 0(^-1/2^ _ 

Hence (7.3) yields 
p,{^) = 0{n-y^) + ^^§^f:nSk = d) 

^ ' k=l 

■ (J2 HSi = n-£-d)+ ^i^i = n-e-d)y (7.5) 

i=k i=k+l 

The term with i = k can be taken twice, just as the ones with i > k, since 
IP(5fc = d) ^{Sk = n — £ — d) = 0(n~^/^) by the same argument as for 

the first sum in (7.3). Further, for i > k, Si — Sk = Si-k and is independent 
of Sk', thus 

F{Si = n-l-d) = ¥{Si^k = n-£-d-Sk) = IEP(5i_fc = n-£-d-Sk \ Sk) 



and Y^Zk ^{Si = n-£-d) = Eu{n-£-d- Sk). Hence, (jT^ yields 

Pd{i) = '^^^^ ^ V nSk = d)Eu{n-£-d-Sk)+0 (n- V2) . (7.6) 
u[n) ^-^ ' 



k=l 



If d is fixed, then E u{n — £ — d — Sk) ^ l-i ^ hy dominated convergence as 
n — > 00 for each /c, and thus (|7.6|) yields, by dominated convergence again. 



Pd{£) ^ F{B2 = £)Y1 nSk = d) = u{d) F{B2 = £). (7.7) 

k=l 

If d ^ 00, we use the fact that u{m) — l[m > 0]fj.^^ is summable over Z 
to see that 

Eu(n - £ - d - Sk) - fi~^F(n - £ - d - Sk > 0) = 0(maxF(Sk = m)), 

m 

which tends to as /c ^ co; on the other hand, F{Sk = d) ^ for every 
fixed k. It follows that ( |7.6| ) yields 

00 

Pd{£) = P(52 = ^) ^ nSk = d)F{Sk<n-£-d) + 0(1). 

k=l 

If Td := min{A; : Sk > d}, and S'f, denotes a copy of 5a,. independent of {Sj}^, 
then 

00 

^P(5fc = d) F{Sk <n-£-d)= u{d) F{S'^^ <n-£-d\Sr^ = d). 

k=l 
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It is easy to see that, with := Var(i?2), as d — > oo, 

((5; - d)/Vd I Sr, =d) = ((^;^ - Sr,)/Vd I 5., =d)^ N{0, 2c7V/i), 

cf. [H] (the extra conditioning on Sr^ = d makes no difference). Hence, 
when d 00, 

Pdii) = r{B2 = i)u{d)<^{{n - I - 2d)/Vd) + 0(1). 

(By ( |7.7p , this holds for fixed d too.) We next observe that <I>((n — £ — 
2d)/Vd) = <^{{n- 2d)/^/nj2) + o(l); this is easily seen by considering 
separately the three cases d/n a £ [0,1/2), d/n — > a G (1/2,1], and 
d/n 1/2 and (n - 2d)l^fnl2 b G [—CO, 00] (the general case follows by 
considering suitable subsequences). Hence, we have when d ^ 00, recalling 
that then u{d) 

Pd{£) = fi-^ F{B2 = £)<^{{n - 2d)/vW2) + o(l). 



For fixed d, this implies that Pn~d-i{^) ~^ 0, and thus (7.2) and (7.7) 
yield 

P(iVd = £)= Pd{£) + Pn-l^d{i) = U{d) P(B2 =£)+ oil). 

Similarly, if d — > cxd and n — d ^ 00, 

F{Nd = £)=Pd{£)+Pn-l-d{£) 

= fl-^F{B2=£){'^{{n-2d)/^/^) +<^{{2d + 2-n)/^/^)) +o{l) 
= /i-ip(52 =£) +0(1). 

We have thus proven convergence as n — > 00, with all moments, A'^^ — ^ 
Xfi for fixed d and A^^^; — ^ X^o for d = d{n) 00 with n — d ^ 00, where 

W{Xd = £) = u{d) ¥{B2 = £) = 2u{d) ¥{B* =£), £>l, (7.8) 

nxd = ^) = i-nxd>i) = i-u{d), (7.9) 

for 1 < d < 00, with u{oo) := fi~^. 

In the unlabeled case, B2 = {B* \ B* > 1) = B* + 1 with B* ~ Ge(l/2). 
Consider a random infinite string 0102 • • • of i.i.d. Be(l/2) binary digits, and 
define a block as a string of m > O's followed by a single 1. Then -Bj+i, 
J > 1, can be interpreted as the successive block lengths in 0102 • • • , and thus 
u{d) is the probability that some block ends at d, i.e., u{d) = P(arf = 1) = 

1/2, for every d > 1. It follows from ([7^-([7^ that = 5* ~ Ge(l/2) 
for every d> 1, and (i) follows. 
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In the labeled case, when B* Po(log2), we use generating functions: 



1 _ e(x-i)iog2 - 2 - e^i°g2 ~ 2^ dl °^ 



d=0 



where we recognize the gererating function (2.6). Thus, u{d) = Rd{log2) /dl. 
(A direct combinatorial proof of this is also easy.) 
We let, using fi:=EB2 = EB*/F{B* > 1) = 2 log 2, 

7d ■.= EXd = u{d)EB2 = fiu{d) = 2log2u{d) = 2Rd{\og2f^^ / d\ 

and note that 7^ — > 700 = 1 as c? ^ 00 since u{d) fJ-^^, or by the known 
asymptotics of Rd [12, (11.16)]. The description of in the statement now 



follows from (7.8 )-(^7.9p. □ 



8. Random bipartite threshold graphs 

The constructions and results in Section [6] have analogues for bipartite 
threshold graphs. The proofs are simple modifications of the ones above and 
are omitted. 

8.1. Increasing set. For any increasing S C [0,1]^, define T„i,n2;5 •= 
G{ni,n2,ls)- In other words, take i.i.d. random variables U[, . . . ,U^_^, 
[/(', ~ U{0, 1) and draw an edge ij if (f//, C/j') G S. 

Theorem 8.1. As ni,n2 — > 00, T„j^„2;5 ~^ ^'s- -^'^ particular, the degree 
distribution vi{Tn-s) ~^ ^i^'s)' '^hich equals the distribution of ips{U) 



defined by (6.1). □ 



As in Section [6j this gives a canonical representation of random bipartite 
threshold graphs under natural assumptions. 

Theorem 8.2. Suppose that {Gni,n2)ni,n2>i Q'^e random bipartite threshold 
graphs with Vi{Gni,n2) = [^1] and V2{Gni,n2) = [^^2] such that the distri- 
bution of each Gni,n2 "^^ invariant under permutations of V\ and V2 and 
that the restrictions (induced subgraphs) of Gn'^+i,n2 <^f^d Gni,n2+i to V{G) 
both have the same distribution as G„j^„2> f^''" every ni,re2 > 1. If further 
ui{Gni,n2) A* ni,n2 00, for some ^ G V, then, for every ni,n2, 

Gnx,n2 — '^ni,n2;S^- D 



8.2. Random weights. Definition (1.9) suggests the following construc- 
tion: 

(8.1) Let X and Y be two random variables and let t S M. Let Xi, X2, ■ ■ ■ , 
be copies of X and ¥±,¥2, . . . , copies of ¥, all independent, and let 
Tni,n2;X,Y,t ^he bipartite threshold graph with vertex sets [ni] and 
[722] and edges ij for all pairs ij such that Xi + ¥j > t. 
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Theorem 8.3. Let S be the increasing set 

S := {{x, y) G (0, 1]^ : F^\x) + Fy\y) > t}. (8.2) 

Then Tn^,n2;X,Y,t = Tnun2;S for every ni,n2 > 1. 

Furthermore, as ni,n2 — > oo, the degree distribution i'i{Tn^^n2;X,Y,t) 
and thus Tn^^n2;X,Y,t r'^, where ^ G V is the distribution of the random 
variable 1 — Fy^t — X), i.e. 

//[0,s] =P(1 -Fy(t-X) < s), sG[0,l]. (8.3) 

□ 



In the special case when ¥{X e [0, 1]) = 1, y ~ [7(0, 1) and t = 1, (8.3) 



yields fi[0, s] = ¥{X < s), so is the distribution of X; further, the set S in 



^.2) is a.e. equal to Sn in (5.3). 



Corollary 8.4. If ji G Vs, let X have distribution ji and let Y ~ [7(0,1). 

ThenTn^^n2;X,Y,t ^ ^ni,n2;5^ for every ui, 712 > 1- Furthermore, asni,n2 oo, 

Z^l(7ni,n2;X,y,t) and Tn-^,n2;X,Y,t T'^- 

This yields another canonical construction for every fi £ V. (We claim 
only convergence in probability in Corollary |8.4[ convergence a.s. holds at 
least along every increasing subsequence (ni(m), n2(m)), see Remark 
8.2].) 



8.3. Random addition of vertices. Definition (1.10) suggests the follow- 
ing construction: 

(8.4) Let r„j^„2;pi,P2 be the random bipartite threshold graph with ni +712 
vertices obtained as follows: Take ni 'white' vertices and n2 'black' 
vertices, and arrange them in random order. Then, join each white 
vertex with probability pi to all earlier black vertices, and join each 
black vertex with probability p2 to all earlier white vertices (other- 
wise, the vertex is joined to no earlier vertex), the decisions being 
made independently by tossing a biased coin once for each white 
vertex, and another biased coin once for each black vertex. 

Let, for pi,p2 G [0, 1], /Up^^pj be the probability measure in V with distri- 
bution function 

U ~ ~^)' P2<X<1. 

Hence, /ipi,p2 has density (1 -pi)/p2 on (0,^2) and pi/{l - P2) on (p2,l); 
if P2 = there is also a point mass 1 — pi at 0, and if p2 = 1 there is 
also a point mass pi at 1. It follows from (5.2) that the corresponding 
subset 5'pi,p2 •= '^Mpi,P2 Pil]^ is the quadrilateral with vertices (0,1), 
(1 — pi, 1 — P2)i (1)0) and (1, 1) (including degenerate cases when pi or p2 
is or 1). 
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This is an extension of the definitions in Subsection 6.3; we have /Xp.p = fip 
id Sp^p = 
if pi = p2. 



and Sp^p = Sp. Note also that //pi,p2 = /^P2,pi- particular, /ipi,p2 ^ "^s only 



Theorem 8.5. As 711,71-2 — > oo, the degree distributions ^i{Tni^n2;pi,p2) — * 
t^pi,P2 o.iT'd i^2(?ni,n2;pi,p2) ^ t^p2,pij Consequently, 7ni,7i2;pi,p2 ^ ''^pi:P2 

r" G T" 

Aipi,P2 00,00- 

Corollary 8.6. Ifpi,p2 G [0,1] andni,n2 > 1, i/ie77 

-'ni,n2;pi,P2 ~ -'rii,n2;5pj^,p2 ~ -'ni,n2;-'fi,X2,05 

where Xj has the density 1 — pj on (—1, 0) and pj on (0, 1), j = 1,2. 

Note that if pi +p2 = 1, then 5'p^,p2 is the upper triangle S1/2 ■= {(x, y) : 
X + y > 1}. Hence the distribution of 7'ni,n2;pi,p2 does not depend on pi as 
long as p2 = 1 — Pi- In particular, we may then choose pi = 1 and p2 = 0. 
In this case. Definition |( 8. 4) simplifies as follows. 

(8.6) Let r„j^„2 the random bipartite threshold graph with ni + n2 
vertices obtained as follows: Take ni 'white' vertices and n2 'black' 
vertices, and arrange them in random order. Join every white vertex 
to every earlier black vertex. 

If pi = 1 and p2 = 0, then further Xi = [/ ~ U{0, 1) and X2 = f/ - 1 
in Corollary |8.6[ Hence, we have found a number of natural constructions 
that yield the same random bipartite threshold graph. 

Corollary 8.7. If pi e [0,1] and ni,n2 > 1, then 

-^ni,n2;pi,l— pi — -^ni,n2;l,0 — ^ni,n2 — -^ni,n2;5i/2 — -^ni,n2;?7,f/,l; 

withU r^U {0,1). □ 

We will see in the next subsection that this random bipartite threshold 
graph is uniformly distributed as an unlabeled bipartite threshold graph. 

8.4. Uniform random bipartite threshold graphs. It is easy to see that 
for every bipartite threshold graph, if we color the vertices in Vi white and 
the vertices in V2 black, then there is an ordering of the vertices such that 
a white vertex is joined to every earlier black vertex but not to any later. 
(For example, if there are weights as in (1.9)[ order the vertices according to 
w'^ and w'j , taking the white vertices first in case of a tie.) This yields a 1- 
1 correspondence between unlabeled bipartite threshold graphs on ni + 7^2 
vertices and sequences of ni white and 712 black balls. Consequently, the 
number of unlabeled bipartite threshold graphs is 

/rii + n2\ 

'^ni,n2 = , "1,712 > 1. 

V ^1 / 

Moreover, it follows that Tn^^n2 is uniformly distributed in '7^i,n2; hence 
Corollary 8.7 yields the following: 
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Theorem 8.8. The random bipartite threshold graphs r„^_n2! Trii,n2;pi,i-pi 
(0 < pi < I), Tnj^^n2]Si/2' '^ni,n2;U,u,i all Uniformly distributed, regarded 
as unlabeled bipartite threshold graphs. □ 

We have not studied uniform random labeled bipartite threshold graphs. 

9. Spectrum of Threshold Graphs 

There is a healthy literature on the eigenvalue distribution of the adja- 
cency matrix for various classes of random graphs. Much of this is focused 
on the spectral gap (e.g., most /c-regular graphs are Ramanujan [5]). See 
Jakobson, Miller, Rivin, Rudnick [18 for evidence showing that random k- 
regular graphs have the same limiting eigenvalue distribution as the Gauss- 
ian orthogonal ensemble. The following results show that random threshold 
graphs give a family of examples with highly controlled limiting spectrum. 

There is a tight connection between the degree distribution of a threshold 
graph and the spectrum of its Laplacian, see [28l [TU |29j. Recall that the 
Laplacian of a graph G, with V{G) = [n], say, is the nxn matrix C = D — A, 
where A is the adjacency matrix of G and D is the diagonal matrix with 
entries da = dcii)- (Thus C is symmetric and has row sums 0.) It is easily 
seen that {Cx, y) = 'Ylij<^E(G)^^i~^i)^yi~yo) ^lU ^ I^"- The eigenvalues 
Aj of C satisfy < Aj < n, i = 1, . . . , n, and we define the normalized spectral 
distribution £ V as the empirical distribution of {Aj/n}"^^. 

For a threshold graph, it is easily seen that if we order the vertices as in 



(1.2) and Subsection 2.1 then for each i = 2, . . . ,n the function 

fiU) ■= 



-1, J < i, 
- 1, J = i, 



0, j > i. 

is an eigenfunction of C with eigenvalue d(i) or d{i)+l, depending on whether 
i is added as isolated or dominating, i.e., whether = or 1 in the binary 
code of the graph. Together with cpi := 1 (which is an eigenfunction with 
eigenvalue for any graph), these form an orthogonal basis of eigenfunctions. 
The Laplacian spectrum thus can be written 

{0}Ll{d{i)+ai:i = 2,...,n}. (9.1) 

In particular, the eigenvalues are all integers. 



Moreover, (9.1) shows that the spectrum {Aj}" is closely related to the 
degree sequence; in particular, asymptotically they are the same in the sense 
that if Gn is a sequence of threshold graphs with v{Gn) — c>o and ji £ V, 
then 

ydGn) -^fi ^ u{Gn) ^ ^. (9.2) 

(See p6] for a detailed comparison of the Laplacian spectrum and the degree 



sequence for threshold graphs.) In particular. Theorem 5.5 can be restated 
using the spectral distribution: 
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Theorem 9.1. Let Gn be a sequence of threshold graphs such that v{Gn) — > 
cxD. Then Gn converges in lA as n ^ oo, if and only if the spectral distri- 
butions yiiGn) converge to some distribution fx. In this case, ^ £ Vs and 

Gn ^ r^. □ 

Remark 9.1. It can be shown that the spectrum and the degree sequence 



are asymptoticahy close in the sense that (9.2 ) holds for any graphs Gn with 



v{Gn) oo, even though in general there is no simple relation like (9.1) 



Another relation between the spectrum and the degree sequence for a 
threshold graph is that their Ferrers diagrams are transposes of each other, 



see |28^ |29] : this is easily verified from (9.1) by induction. If we scale the 
Ferrers diagrams by n, so that they fit in the unit square [0, 1]^ with a corner 
at (0, 1), then the lower boundary is the graph of the empirical distribution 
function of the corresponding normalized values, i.e., the distribution func- 
tion of z^(G) or vl[G). Hence, these distribution functions are related by 



reflection in the diagonal between (0,1) and (1,0), so by (5.11) (and the 
comment after it), for any threshold graph G, 

vl(G) = u{G)^. 
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